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Abstract 

A  longstanding  issue  in  attempts  to  understand  the  Everett  (Many- Worlds)  approach 
to  quantum  mechanics  is  the  origin  of  the  Born  rule:  why  is  the  probability  given  by  the 
square  of  the  amplitude?  Following  Vaidman,  we  note  that  observers  are  in  a  position  of  self- 
locating  uncertainty  during  the  period  between  the  branches  of  the  wave  function  splitting 
via  decoherence  and  the  observer  registering  the  outcome  of  the  measurement.  In  this  period 
it  is  tempting  to  regard  each  branch  as  equiprobablc,  but  we  argue  that  the  temptation 
should  be  resisted.  Applying  lessons  from  this  analysis,  we  demonstrate  (using  methods 
similar  to  those  of  Zurek’s  envariance-based  derivation)  that  the  Born  rule  is  the  uniquely 
rational  way  of  apportioning  credence  in  Everettian  quantum  mechanics.  In  doing  so,  we  rely 
on  a  single  key  principle:  changes  purely  to  the  environment  do  not  affect  the  probabilities 
one  ought  to  assign  to  measurement  outcomes  in  a  local  subsystem.  We  arrive  at  a  method 
for  assigning  probabilities  in  cases  that  involve  both  classical  and  quantum  self-locating 
uncertainty.  This  method  provides  unique  answers  to  quantum  Sleeping  Beauty  problems, 
as  well  as  a  well-defined  procedure  for  calculating  probabilities  in  quantum  cosmological 
multiverses  with  multiple  similar  observers. 
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1  Introduction 


The  Everett  (or  Many- Worlds)  approach  to  quantum  mechanics  is  distinguished  by  its  simplicity. 
The  dynamics  of  the  theory  consists  of  a  single  deterministic  evolution  law,  the  Schrodinger 
equation.  There  are  no  separate  rules  for  dealing  with  ‘wave  function  collapse’  and  ‘quantum 
measurement,’  nor  are  there  additional  hidden  variables.  A  longstanding  challenge  for  such  an 
approach  is  to  reproduce  the  Born  rule:  the  probability  of  a  measurement  yielding  eigenvalue 
a  of  the  observable  A,  given  that  the  system  was  prepared  in  state  is  |(a|'0)|2,  where  |a) 
is  the1  eigenstate  with  eigenvalue  a.  Indeed,  it  seems  like  quite  a  challenge  to  explain  how 
Everettian  quantum  mechanics  could  provide  a  theory  of  probabilities  at  all,  given  that  states 
can  be  specified  with  arbitrary  precision  and  all  evolution  is  perfectly  deterministic. 

In  this  paper  we  argue  that  probability  arises  in  Everettian  quantum  mechanics  because 
observers  with  perfect  knowledge  (of  their  immediate  circumstances  and  the  state  of  the  universe 
as  a  whole)  necessarily  evolve  into  conditions  of  self-locating  uncertainty,  in  which  they  do  not 
know  which  approximately  isolated  semi-classical  world  (or  ‘branch’)  they  inhabit.2  We  propose 
a  general  principle,  the  Epistemic  Separability  Principle  ( ESP ),  which  captures  the  idea  that 
predictions  made  by  local  agents  should  be  independent  of  their  environment.  Given  ESP,  we 
show  that  there  is  a  unique  rational  way  for  such  an  agent  to  assign  a  credence  to  each  of  the 
quantum  worlds  they  might  be  in.  These  credences  are  precisely  the  ones  recommended  by  the 
Born  rule.  The  probabilities  are  fundamentally  subjective  in  the  sense  that  that  they  are  not 
written  into  the  laws — as  they  are  in  spontaneous  collapse  theories—  but  instead  capture  the 
degrees  of  belief  of  a  rational  agent;  however,  they  are  objective  in  the  sense  that  a  rational 
agent  must  assign  Born  rule  probabilities  (if  ESP  is  correct). 

We  believe  that  recent  proofs  of  the  Born  rule  (Deutsch,  [1999];  Wallace,  [2010c];  Zurek, 
[2005])  are  on  the  right  track,  latching  onto  symmetries  in  the  theory  that  serve  to  explain  the 
validity  of  the  Born  rule  in  Everettian  quantum  mechanics.  However,  there  are  serious  objections 
to  the  approaches  already  explored  and  many  remain  unconvinced,  so  we  offer  this  derivation 
as  a  novel  alternative.  We  seek  to  provide  an  epistemic — as  opposed  to  a  decision-theoretic — 
derivation  of  the  Born  rule  which  connects  quantum  uncertainty  to  the  sort  of  self-locating 
uncertainty  present  in  very  large  universes  (discussed  recently  in  Hartle  &  Srednicki,  [2007]; 
Page,  [2007];  Srednicki  &  Hartle,  [2010]).  This  approach  shares  formal  features  with  Zurek’s 
([2003a];  [2003b];  [2005])  argument  based  on  the  idea  of  envariance,  while  offering  a  clearer 
explanation  of  the  way  in  which  probabilities  arise  in  a  deterministic  setting. 

At  first  glance,  standard  treatments  of  self-locating  uncertainty  seem  to  generate  a  serious 
problem  for  the  many- worlds  interpretation.  Elga  ([2004])  has  put  forward  a  compelling  principle 
of  indifference  for  cases  of  self-locating  uncertainty,  roughly:  an  observer  should  give  equal 
credence  to  any  one  of  a  discrete  set  of  locations  in  the  universe  that  are  consistent  with  the 
data  she  has.  A  simplistic  extension  to  the  quantum  case  would  seem  to  favor  treating  each 


1  Assuming  there  is  a  unique  eigenstate  with  eigenvalue  a. 

2The  increase  in  the  number  of  branches  over  time  is  a  consequence  of  unitary  evolution,  not  an  additional 
postulate  (Wallace,  [2003a],  [2010a]). 
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world  as  equiprobable  (branch-counting)  rather  than  the  Born  rule,  since  Everettian  observers 
on  different  branches  find  themselves  in  situations  of  identical  data.  We  believe  that  the  reasoning 
behind  Elga’s  principle,  when  properly  applied  to  Everettian  quantum  mechanics,  actually  leads 
to  the  Born  rule — not  branch-counting. 

One  common  assumption  appealed  to  in  determining  what  probabilities  one  ought  to  assign 
is  that  certain  features  of  the  case  are  irrelevant,  in  particular,  one  can  alter  the  environment  in 
many  ways  without  changing  what  probability  assignments  are  rational.  We  formalize  this  idea 
by  proposing  a  constraint  on  rational  credences  (ESP)  and  showing  that,  somewhat  surprisingly, 
the  principle  suffices  to  derive  the  Born  rule.  A  careful  derivation  will  be  provided  after  extensive 
stage-setting  in  §3.2  and  appendix  C,  but  the  spirit  of  the  approach  can  be  seen  in  this  simplified 
version  of  the  argument  for  two  equal  amplitude  branches.  Imagine  that  there  are  two  quantum 
measurement  devices  which  measure  the  2-spin  of  an  x-spin  up  particle,  the  first  displaying 
either  or  <0,  the  second  either  £  or  4|b  depending  on  the  result  of  the  measurement.  The 
post-measurement  state  of  the  detectors  can  be  written  as 

p)  | *)  +  |0)  |*)  •  (l) 

If  we  assume  there  is  an  experimenter  who  has  yet  to  observe  the  outcome,  they  ought  to  be 
uncertain  about  which  of  the  two  branches  they  are  on.  The  probabilities  assigned  to  the  possible 
states  of  detector  1  (O’/O)  should  be  the  same  if  the  case  is  modified  so  that  detector  2  is  wired 
to  show  the  opposite  symbols  from  those  in  (1)  as  the  state  of  the  second  detector  is  irrelevant 
to  to  the  experimenter’s  question  about  their  relation  to  the  first  detector, 

p)  A  +  |0)  |*)  .  (2) 

The  probability  of  in  (1) — which  is  the  probability  of  being  in  the  first  branch  of  (1) — is  the 
same  as  that  of  O  in  (2) — which  is  the  probability  of  being  in  the  first  branch  of  (2).  If  instead 
we  focus  on  assigning  probabilities  to  the  possible  states  of  detector  2  (Jh/J^),  they  too  should 
be  the  same  in  (1)  and  (2).  Focusing  on  4|l  yields  the  requirement  that  the  probability  of  being 
in  the  second  branch  of  (1)  be  the  same  as  the  probability  of  being  in  the  first  branch  of  (2). 
And  thus  the  probabilities  assigned  to  each  branch  of  (1)  must  be  equal. 

In  §2,  we  begin  by  introducing  the  many-worlds  interpretation  and  discussing  how  self- 
locating  uncertainty  arises  during  quantum  measurements.  If  we  extend  standard  methods  of 
handling  self-locating  uncertainty  to  quantum  cases  in  a  simple  way,  we  get  the  odd  recommen¬ 
dation  that  branches  should  be  weighted  equally  regardless  of  their  quantum  amplitudes.  We 
argue  that  this  cannot  be  the  right  way  to  handle  the  uncertainty  that  arises  in  cases  of  quantum 
branching.  In  §3  we  provide  an  alternative  principle  for  handling  self-locating  uncertainty  across 
physical  theories  (ESP)  and  show  that  it  yields  the  Born  rule  when  applied  to  Everettian  quan¬ 
tum  mechanics.  In  §4,  we  show  that  this  principle  (when  strengthened)  yields  the  result  that  each 
copy  of  oneself  should  be  judged  equiprobable  in  cases  of  classical  self-locating  uncertainty.  We 
then  use  the  principle  to  generate  probabilities  in  cases  which  involve  both  uncertainty  arising 
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from  quantum  measurement  and  from  duplication  of  one’s  experiences  elsewhere  in  the  universe, 
such  as  ‘quantum  sleeping  beauty’  scenarios  and  quantum  measurements  in  very  large  universes. 
Then,  in  §5  we  argue  that  although  in  Everettian  quantum  mechanics  there  is  nothing  to  be 
uncertain  of  before  measurement  (in  idealized  cases  where  the  wave  function  and  the  dynamics 
are  known),  we  can  still  generate  rules  for  rational  decision  making  pre-measurement  and  the¬ 
ory  testing  post-measurement.  In  §6  we  compare  our  treatment  with  the  approaches  of  Zurek, 
Deutsch,  and  Wallace. 

Before  diving  in,  a  choice  of  terminology.  In  this  article  we’ll  use  ‘universe’  to  denote  the 
collection  of  all  branches  as  described  by  the  universal  wave  function,  reserving  ‘world’  as  a  way 
of  denoting  a  particular  branch  of  the  wave  function,  i.e. ,  one  of  the  many  worlds  of  Everettian 
quantum  mechanics.  Using  this  terminology,  ‘quantum  multiverse’=‘uni verse.’ 

2  Preliminaries:  Many- worlds,  Self-locating  Uncertainty, 

and  Branch-counting 

2.1  The  Many- worlds  Interpretation 

If  the  state  of  the  universe  is  given  by  a  wave  function  and  that  wave  function’s  time  evolution  is 
at  all  times  in  accordance  with  the  Schrodinger  equation,  strange  things  start  to  happen.  Many  of 
us  have  come  to  accept  that  tiny  particles  can  be  in  superpositions  of  distinct  states,  e.g.,  passing 
through  two  slits  at  once.  But,  if  the  wave  function  always  obeys  the  Schrodinger  equation  then 
these  microscopic  superpositions  can  be  amplified,  leading  to  cases  where  macroscopic  measuring 
devices  and  even  human  beings  are  in  superpositions  of  distinct  states. 

Once  At  t\  Alice  has  prepared  particle  a  in  the  £-spin  up  eigenstate, 


(3) 


She  then  measures  the  z-spin  of  the  particle.  At  t±,  she  has  just  observed  the  outcome  of 
the  experiment.  If  the  particle  were  in  a  2-spin  eigenstate,  Alice  and  the  measuring  device 
would  simply  record  that  outcome.  But,  since  the  particle  is  in  a  superposition  of  z-spins, 
Alice  and  her  device  evolve  into  a  superposition  of  different  recorded  outcomes.  The  state 
of  Alice  (A),  her  particle  (a),  the  measuring  device  ( D ),  and  the  rest  of  the  universe  evolve 
as  below.  Here  R  indicates  the  ready  state  and  f/1  denote  the  possible  outcomes. 


I’J'Once(U)}  —  | R) A  I R) D  ltx}0  \Eq) 

l-JW^))  =  ^  \t)A  lt)D  lt*>„  \Ei)  +  \i)A  iw„  \E2) 


(4) 


In  I^once^))  both  Alice  and  the  detector  have  entered  superpositions  of  recording  up  and 
down  as  the  result  of  the  measurement.  In  practice,  just  one  outcome  is  observed.  One  could 
modify  the  theory  to  ensure  that  only  a  single  outcome  actually  occurs.  This  is  the  strategy 
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taken  by  Bohmian  mechanics,  GRW  theory,  and  other  venerable  responses  to  the  measurement 
problem.  However,  such  a  modification  may  be  unnecessary.  According  to  the  many-worlds 
interpretation,  I^once^))  is  a  state  where  there  are  two  (or  more)  copies  of  Alice,  each  of  which 
has  just  observed  the  readout  of  a  2-spin  measurement.  There’s  a  version  of  Alice  who  saw  up 
and  a  version  who  saw  down,  but  no  version  that  saw  both  up  and  down.  Seeing  a  single  definite 
outcome  is  not  precluded  by  the  state  evolution  given  above,  it  is  guaranteed  since  every  one  of 
Alice’s  successors  sees  a  definite  outcome. 

Even  if  each  copy  of  Alice  has  an  ordinary  experience  at  £4,  one  still  might  be  worried 
about  the  fact  that  there  exist  parts  of  the  universe  in  which  each  possible  outcome  happens. 
Particularly,  one  might  hope  that  after  a  measurement  like  this  is  made  and  Alice  dutifully 
records  up  as  the  result,  she  can  forget  about  the  fact  that  the  particle  might  have  been  down 
for  the  sake  of  all  her  future  calculations.  But,  if  the  state  of  the  universe  is  as  in  (4),  then  the 
possibility  that  the  result  might  have  been  down  is  no  mere  possibility,  that  part  of  the  wave 
function  is  still  out  there  and  could  potentially  interfere  with  Alice’s  part.  If  the  many-worlds 
interpretation  is  to  be  viable,  there  must  be  some  reason  why  Alice  can  ignore  the  other  worlds 
in  which  the  experiment  turned  out  differently.  Fortunately,  there  is:  decoherence. 

As  the  result  of  the  measurement  is  recorded  in  the  device  and  observed  by  Alice,  many 
traces  of  this  result  will  appear  in  the  environment.  Due  to  the  large  number  of  traces  present 
in  the  environment,  the  states  E\  and  £2  are  (at  least  approximately)  orthogonal  and  can 
be  expected  to  stay  orthogonal  as  time  progresses.  Because  these  environment  states  are  and 
will  be  orthogonal,  the  two  components  of  |\&once(^4))  will  evolve  in  a  non-interacting  way,  each 
component  evolving  in  accordance  with  the  Schrodinger  equation  as  if  the  other  were  not  present. 
They  can  thus  be  treated  as  separate  worlds  (or  collections  of  worlds)  since  they  are  effectively 
causally  isolated  and  within  each  of  them  there  are  versions  of  Alice  having  clear  and  determinate 
experiences.3 

In  quantum  mechanics,  the  state  of  a  subsystem  is  given  by  a  reduced  density  matrix,  gen¬ 
erated  by  taking  a  partial  trace  over  the  Hilbert  space  of  the  environment  (Schlosshauer,  [2007], 
§2.4;  Nielsen  &  Chuang,  [2010],  §2.4).  In  ordinary  cases  of  quantum  measurement,  the  process 
of  decoherence  diagonalizes  the  reduced  density  matrix  for  the  macroscopic  degrees  of  freedom 
describing  our  observer  and  measuring  apparatus  in  a  very  specific  basis,  the  so-called  pointer 
basis  (Zurek,  [1981],  [1982],  [1993],  [2003a]).  The  pointer  basis  is  distinguished  by  being  robust 
with  respect  to  ordinary  interactions  with  the  environment. 


3For  more  on  how  Everettian  branches  arise  from  decoherence,  see  (Schlosshauer,  [2005];  Saunders,  [2010a], 
§1.3)  and  references  therein.  One  might  object  that  this  sort  of  story  about  the  emergence  of  separate  worlds 
cannot  be  assumed  in  a  derivation  of  the  Born  rule.  See  appendix  B. 
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The  state  of  Alice,  the  detector,  and  particle  a  at  t\  can  be  written  as  a  reduced  density 
matrix  by  tracing  out  the  environment, 

PADa(tl )  =  TVe^  |$o  NCE  (h))  (*o  nce(^i) I ^ 

=  2  \R)a  \R)d  ltz)a  (tz|a  {R\d  (R\a  +  9  \R)a  \R)d  ltz)a  (4-zla  (R\ D  ( R\a 

A  9  \R)d  l'l'z)a  (tz|a  (R\d  (R\a  A  7^  |-^)a  \R) D  l-l-z)a  (4-z|a  {R\ D  (R\a  ■  (®) 

As  the  0-spin  of  the  particle  has  yet  to  be  entangled  with  the  measuring  device  and  the  environ¬ 
ment,  the  diagonal  terms  of  the  reduced  density  matrix  (the  middle  two  terms  of  the  sum  in  the 
second  line  of  (5))  are  non-zero  and  potentially  very  relevant  to  the  evolution  of  the  system — 
e.g.,  if  the  :r-spin  of  the  particle  were  to  be  measured.  After  the  measurement  is  made  and  the 
environment  gets  entangled  with  the  particle’s  spin,  the  reduced  density  matrix  becomes 

PADaih)  =  \  (-El  |  El)  \t)A  |t)D  It  (t*|0  (tlD  (t|A  +  \  (^ll^2)  \t)A  lt)D  |t*)„  (tz|a  (tin  UU 

+  x-  <£b|Ei)  \i)A  |4.)d  ||,)a  (Ua  (tlB  (tu  +  \  (^2|e2)  14.) a  |4.)d  I4.*)a  UzL  (i\D  UU 

=  2  It)^  lt)_D  ltz)a  (tz|a  (tin  (tlx  +  2  It) A  lt)r>  |tz)a  (tz|a  (tin  UIa  ■  (®) 

Here  we’ve  assumed  for  simplicity  that  E\  and  E2  are  perfectly  orthogonal,  (-E1I-E2)  =  0,  and  the 
diagonal  terms  vanish;  in  realistic  cases  (£9  |  A2)  is  very  close  to  zero.  The  diagonal  terms  have 
dropped  out  and  the  state  is  now  decomposable  into  two  parts — branches — of  the  reduced  density 
matrix,  one  in  which  the  result  was  up  and  a  second  in  which  it  was  down.  It  is  common  to  speak 
of  the  universal  wave  function  splitting  into  several  branches  upon  measurement,  components 
that  will  no  longer  interact  because  of  decoherence.4  Similarly,  one  can  just  as  easily  speak  of  the 
density  matrix  for  the  universe  as  being  decomposable  into  branches.  Reduced  density  matrices 
can  also  evolve  into  pieces  that  because  of  decoherence  will  no  longer  interact.  We  call  these 
pieces  ‘branches’  too,  although  this  is  a  non-standard  use  of  the  term  ‘branch.’  Branches  of  a 
reduced  density  matrix  should  not  be  confused  with  branches  of  the  universal  wave  function. 
A  branch  of  a  reduced  density  matrix  may  correspond  to  many  branches  of  the  universal  wave 
function,  as  occurs  in  the  possible  elaboration  of  Once  below. 

Once-OR-Twice  Alice’s  particle  (a)  and  Bob’s  particle  (6)  are  both  initially  prepared  in  the 
i-spin  up  eigenstate.  Alice’s  device  measures  the  2:-spin  of  her  particle  first.  Then,  Bob’s 
device,  which  is  connected  to  Alice’s,  measures  0-spin  of  particle  b  only  if  particle  a  was 
measured  to  have  0-spin  up.  By  t±  the  setup  is  prepared;  by  t2  Alice’s  particle  has  been 
measured  but  Bob’s  has  not;  by  t3  both  particles  have  been  measured.  Bob  has  been 
watching  as  the  results  of  the  experiments  are  recorded.  Up  through  £3 ,  Alice  has  not 
looked  at  the  measuring  devices  and  is  unaware  of  the  results.  By  £4,  Alice  has  looked 


4Although  we  will  use  the  language  of  ‘branching’  throughout  the  paper,  we  do  not  mean  by  this  to  prejudge 
the  issue  of  whether  these  branches  diverge  or  overlap  (discussed  in  Saunders,  [2010b];  Wilson,  [2012]).  Our 
strategy  can  be  implemented  on  either  picture  (see  footnote  28). 
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at  her  device  and  seen  the  result  of  the  measurement  of  particle  a,  although  she  remains 
ignorant  about  the  z-spin  of  particle  b.  The  branching  structure  of  this  scenario  is  shown 
in  figure  1.  The  evolution  of  the  state  is  shown  below.  Here  Bob  ( B )  and  his  particle  ( b ) 
have  been  pulled  out  of  the  environment,  but  otherwise  the  state  is  broken  down  as  in  (4) 
(Bob’s  device  is  treated  as  part  of  the  environment;  X  indicates  that  the  measurement  was 
not  made). 


l*OT(ti)>  =  \R)a  I R)d  ltx)a  I R)b  |t*)b \Er) 

l*OT(i2)>  =  I R)a  \t)D  ltz)a  |t)B  \Ub  \Et)  +  \R)a  I4->D  \iz)a  IDs  |tx)6  |^) 

l*OT(i3)>  =  \  I R)a  lt)c  It*)„  It,  t)B  It*)* \Eft)  +  \  I R)a  lt)B  |t*)„  It4)s  14-*)*  l^u) 

+  I R) A  It)!)  |4-z)a  It,  X)B  |tx)b  l-^cpc) 

I IkoT (^4))  =  2  It) A  It) Z5  ltz)Q  lt,t)s  Its)*  |^tt)  2  l^a  lt>t)s  It*)*  l^tt) 

+  ^tMI)DlUaltX)B|t,)b|i?ix)  (t 


If  we  focus  on  the  state  of  Alice,  the  detector,  and  particle  a  at  t±,  the  reduced  density 
matrix  for  Once-Or-Twice  will  be  exactly  as  it  was  in  Once  (6).  However,  this  is  clearly 
a  case  where  there  are  at  least  three  branches  of  the  universal  wave  function  corresponding  to 
the  three  different  possible  outcomes  of  the  series  of  experiments  as  recorded  by  Bob.  In  this 
case,  the  branch  of  pADa  in  which  Alice  observes  up  corresponds  to  at  least  two  branches  of  the 
universal  wave  function,  as  there  are  two  possible  outcomes  Bob  might  have  observed. 


2.2  Self-locating  Uncertainty  and  the  Everettian  Multiverse 

On  many  interpretations  of  quantum  mechanics,  quantum  measurements  yield  just  one  result. 
Before  a  measurement  is  made,  the  experimenter  is  typically  uncertain  about  which  outcome  will 
actually  occur,  even  if  he  knows  as  much  about  the  universal  wave  function  as  could  be  helpful. 
This  uncertainty  can  be  quantified  using  the  Born  rule.  In  the  many-worlds  interpretation,  if 
we  assume  that  the  experimenter  knows  the  relevant  information  about  the  wave  function,  it 
is  unclear  what  the  agent  might  be  uncertain  of  before  a  measurement  is  made.  They  know 
that  every  outcome  will  occur  and  that  they  will  have  a  successor  who  sees  each  possible  result. 
Before  we  can  discuss  the  quantitative  problem  of  what  numerical  probabilities  an  agent  in  an 
Everettian  multiverse  should  assign  to  different  outcomes,  we  must  answer  the  question:  What 
can  one  assign  probabilities  to?  Our  answer  will  be  that  agents  performing  measurements  pass 
through  a  period  of  self-locating  uncertainty,  in  which  they  can  assign  probabilities  to  being  one 
of  several  identical  copies,  each  on  a  different  branch  of  the  wave  function. 

Consider  Alice  in  Once-OR-Twice.  At  time  <2  the  state  can  be  written  in  either  of  two 
equivalent  forms, 

l*OT(i2)>  =  -j=  | R)a  ( \t)D  Itz)„  |t>B  lt*>6  \Et)  +  \i)D  IWa  |4.)b  lt*>6 1£|>  )  (8) 

=  lt)_D  ltz)a  lt)_B  ltx)f,  |-®f)  +  I E) A  I4-)d  14-2)0  |4-)S  ltx)6  l-E't)  ■  (9) 

In  the  first  way  of  writing  the  state,  it  appears  as  if  there  is  only  one  copy  of  Alice  (represented  by 
while  the  rest  of  the  state  has  branched  in  two.  But  in  the  second  expression,  it  appears 
as  if  there  are  two  branches  with  two  identical  copies  of  Alice.  (8)  and  (9)  are  manifestly 
mathematically  identical,  but  suggest  different  physical  pictures,  corresponding  to  two  plausible 
attitudes  about  wave  function  branching.  One  attitude  would  be  that  Alice  herself  doesn’t  branch 
until  her  own  reduced  density  matrix  splits  into  multiple  distinct  branches  (in  the  pointer  basis) 
sometime  between  t 3  and  £4;  in  that  case  (8)  gives  the  right  physical  picture,  and  there  is  only 
one  copy  of  Alice  even  after  the  first  measurement  has  occurred.  The  other  attitude  is  that 
branching  happens  throughout  the  whole  wave  function  whenever  it  happens  anywhere.  When 
the  universal  wave  function  splits  into  multiple  distinct  and  effectively  non-interacting  parts,  the 
entire  world  splits — along  with  every  object  and  agent  in  it.  Then  (9)  is  the  more  perspicuous 
way  of  writing  the  state.  There  are  two  identical  post-measurement  copies  of  Alice,  one  on  each 
branch.5 

We  will  work  under  the  assumption  that  the  second  attitude  (branching  happens  throughout 
the  wave  function  whenever  it  happens  anywhere)  is  the  right  way  to  think  about  the  process.6 
The  branching  structure  of  the  wave  function  in  Everettian  quantum  mechanics  is  not  uniquely 
defined  by  the  theory,  which  is  just  a  state  space  and  a  dynamical  law;  like  coarse-graining 


5Wallace  ([2012],  §8.6)  advocates  a  version  of  the  first  picture  and  Vaidman  ([2014a],  [2014b],  [forthcoming]) 
the  second. 

bIn  our  derivation  of  the  Born  rule,  §3.2,  this  second  attitude  plays  an  important  role  in  allowing  us  to  examine 
cases  like  (17)  where  the  agent  is  in  a  physically  identical  state  on  each  branch. 
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in  classical  statistical  mechanics,  there  is  a  subjective  element  in  how  it  is  constructed.  We 
must  decide  what  kind  (and  what  degree)  of  separation  is  required  for  two  putative  branches 
to  count  as  distinct  (this  involves  choosing  an  appropriate  system/environment  split,  or,  in  a 
consistent-histories  approach,  choosing  which  coarse-grained  observables  to  use).  However,  treat¬ 
ing  branching  on  a  subsystenr-by-subsystenr  basis — as  advised  by  the  first  attitude — is  needlessly 
arbitrary;  it  depends  on  how  the  universe  is  carved  into  subsystems.  Some  very  small  piece  of  a 
post-measurement  detector  might  remain  in  the  same  state  on  every  branch,  but  it  seems  wrong 
to  think  that  whether  the  single  piece  has  branched  into  many  depends  on  whether  it  is  consid¬ 
ered  as  part  of  the  detector  or  as  a  separate  subsystem  in  its  own  right.  For  Alice’s  purposes, 
it  makes  sense  to  distinguish  between  the  two  copies  of  herself  in  (9);  even  though  they  are 
currently  unentangled  with  the  rest  of  the  state,  in  the  future  they  will  evolve  differently  (and 
independently  of  one  another)  as  Alice  becomes  aware  of  the  measurement  outcome.  (In  partic¬ 
ular,  she  is  allowed  to  make  that  distinction,  and  doing  so  enables  her  to  assign  probabilities  to 
measurement  outcomes;  probabilities  which  we  will  show  must  obey  the  Born  rule.) 

The  non-local  nature  of  the  globally-branching  view  might  cause  some  discomfort.  It  implies 
that  observers  here  on  Earth  could  be  (and  almost  surely  are)  branching  all  the  time,  with¬ 
out  noticing  it,  due  to  quantum  evolution  of  systems  in  the  Andromeda  Galaxy  and  elsewhere 
throughout  the  universe.'  We  take  this  to  be  one  among  many  psychologically  unintuitive  but 
empirically  benign  consequences  of  Everettian  quantum  mechanics. 

There  are  two  copies  of  Alice  at  t2  in  Once-OR-Twice.  Each  copy  can  reasonably  wonder 
which  one  she  is.  Thus  even  if  she  (incredibly)  knows  the  universal  wave  function  exactly,  Alice 
still  has  something  to  be  uncertain  of.  She  isn’t  uncertain  about  the  way  the  universe  is;  by 
supposition  she  knows  the  wave  function  and  this  gives  a  complete  specification  of  the  state 
of  the  universe.  Alice  is  uncertain  about  where  she  is  in  the  quantum  multiverse  (as  has  been 
emphasized  by  Vaidman  [1998],  [2011],  [2014a],  [2014b]).  She  doesn’t  know  if  she’s  in  the  branch 
of  the  wave  function  in  which  the  detector  displays  up  or  the  one  in  which  it  shows  down.  We 
say  that  Alice  has  self-locating  uncertainty  (see  Lewis,  [1979];  Bostrom,  [2002]).  We  call  this 
period  in  which  self-locating  uncertainty  is  present,  after  the  measurement  has  been  made  and 
branching  has  occurred  via  decoherence  but  before  the  experimenter  has  registered  the  result, 
the  ‘post-measurement  pre-observation’  period. 

During  the  post-measurement  pre-observation  there  are  multiple  copies  of  Alice  seeing  differ¬ 
ent  results.  Are  any  or  all  of  them  the  same  person  as  Alice  before  the  measurement?  This  is  a 
tricky  metaphysical  question  upon  which  we  will  not  speculate.  We  will  indiscriminately  refer  to 
a  post-branching  version  of  Alice  as  a  ‘copy,’  or  a  ‘successor,’  or  simply  ‘Alice.’  In  appealing  to 
self-locating  uncertainty  during  the  post-measurement  pre-observation  period,  all  that  is  needed 
is  that  each  copy  of  Alice  can  coherently  wonder  what  sort  of  branch  they  inhabit. 

At  this  point  it  may  be  objected  that  Once-OR-Twice  is  highly  atypical  as  Alice  is  for  a  time 


7However,  the  reduced  density  matrix  describing  the  state  of  the  observer  on  Earth  is  unaffected  by  the  events 
occurring  in  the  Andromeda  galaxy.  In  a  relativistic  context,  one  natural  option  would  be  to  say  that  although 
the  number  of  copies  of  an  observer  is  frame-relative,  the  probabilities  assigned  to  measurement  outcomes  are  not 
(as  the  reduced  density  matrix  is  all  that  is  relevant  to  the  calculation  of  probabilities,  see  §3.1). 
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unaware  of  the  measurement  result.  Actually,  self- locating  uncertainty  is  generic  in  quantum 
measurement.  In  Everettian  quantum  mechanics  the  wave  function  branches  when  the  system 
becomes  sufficiently  entangled  with  the  environment  to  produce  decoherence.  The  normal  case 
is  one  in  which  the  quantum  system  interacts  with  an  experimental  apparatus  (cloud  chamber, 
Geiger  counter,  electron  microscope,  or  what  have  you)  and  then  the  observer  sees  what  the 
apparatus  has  recorded.  For  any  realistic  room-temperature  experimental  apparatus,  the  deco¬ 
herence  time  is  extremely  short:  less  than  10“20  seconds.  Even  if  a  human  observer  looks  at 
the  quantum  system  directly,  the  state  of  the  observer’s  eyeballs  will  decohere  in  a  comparable 
time.  In  contrast,  the  time  it  takes  a  brain  to  process  a  thought  is  measured  in  tens  of  millisec¬ 
onds.  No  matter  what  we  do,  real  observers  will  find  themselves  in  a  situation  of  self-locating 
uncertainty  (after  decoherence,  before  the  measurement  outcome  has  been  registered).8  The 
observer  may  not  have  enough  time  to  think  and  reorganize  their  credences  before  learning  the 
outcome  (Wallace,  [2006],  §4.2),  but,  in  trying  to  approximate  ideal  rationality,  the  agent  can  at¬ 
tempt  to  reconstruct  the  probabilities  they  ought  to  have  assigned  during  the  post-measurement 
pre-observation  period.  Despite  often  being  short-lived,  these  probabilities  are  relevant  for  de¬ 
ciding  what  to  believe  after  seeing  the  measurement  outcome,  §5.2,  and  deciding  how  to  act  in 
anticipation  of  the  measurement,  §5.1. 

Self-locating  uncertainty  has  been  discussed  in  great  depth  by  formal  epistemologists.  The 
general  question  of  how  to  extend  Bayesian  updating  to  cases  involving  self-locating  uncertainty 
is  very  difficult  and  a  matter  of  contemporary  philosophical  debate,  but  many  different  proposals9 
agree  on  one  minimal  constraint  on  rational  credences  (a.k.a.  subjective  probabilities  or  degrees 
of  belief): 

Indifference  Suppose  that  in  a  given  possible  universe  U,  agent  S  has  a  finite  number  Njj  of 
copies  in  an  internally  qualitatively  identical  state  to  S.  Further,  suppose  that  the  hypoth¬ 
esis  H  is  true  for  Njjh  of  these  copies.  Then,  S  should  assign  the  following  conditional 
credence  to  H  being  true  given  that  S  is  in  U: 

P{H\U)  =  ^  (10) 

Indifference  was  originally  put  forward  in  (Elga,  [2004]);  the  version  here  is  from  (Manley, 
[unpublished])  (paraphrased).  Two  agents  are  in  the  same  internal  qualitative  state  if  they  have 
identical  current  evidence:  the  patterns  of  colors  in  their  visual  fields  are  identical,  they  recall 
the  same  apparent  memories,  they  both  feel  equally  hungry,  etc.  This  principle  should  not  be 
confused  with  a  Laplacean  ‘principle  of  indifference.’ 

To  see  Indifference  in  action,  consider  the  following  case  from  Elga  ([2004]): 

Duplicating  Dr.  Evil  Dr.  Evil  is  plotting  the  destruction  of  Earth  from  his  lunar  battle  sta¬ 
tion  when  he  receives  an  unwelcome  message.  Back  on  earth  some  pesky  philosophers  have 


8Although  we  think  that  self-locating  uncertainty  is  typical,  it  may  not  be  necessary  for  our  account  that  it  is 
always  present.  See  the  end  of  §5.2. 

9In  particular,  we’re  thinking  of  the  proposals  of  (Bostrom,  [2002];  Meacham,  [2008];  Manley,  [unpublished]) 
as  formulated  in  (Manley,  [unpublished]). 
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duplicated  the  entirety  of  his  battle  station,  perfectly  replicating  every  piece  of  furniture, 
every  weapon,  and  every  piece  of  food,  even  replicating  the  stale  moon  air  and  somehow 
the  weaker  gravitational  field.  They  went  so  far  that  at  t  they  created  a  duplicate  of  Dr. 
Evil  himself,  Dup.  Dup’s  immediate  experiences  are  internally  identical  to  Dr.  Evil’s.  The 
two  men  pace,  think,  and  scratch  themselves  in  perfect  synchronicity. 

Upon  learning  of  his  duplication,  what  credence  should  Dr.  Evil  assign  to  being  in  his  lunar 
base  and  not  the  terrestrial  fake?  According  to  Indifference ,  Dr.  Evil  should  assign  a  credence 
of  one  half  (and  Dup  should  do  the  same).  Dr.  Evil  knows  that  in  any  universe  U  he  might 
inhabit  there  are  two  copies  of  himself  (Njj  =  2)  and  that  one  is  on  the  moon  (Njjh  =  1  for 
H  =  ‘I  am  on  the  moon.’). 

2.3  Indifference  and  the  Quantitative  Probability  Problem 

In  the  previous  section  we  showed  that  in  the  post-measurement  pre-observation  period  it  is 
possible  to  assign  probabilities  to  the  different  possible  outcomes  of  a  completed  experiment. 
But,  what  probabilities  ought  one  to  assign?  If  you  take  Indifference  to  be  a  constraint  on 
rational  probability  assignments  and  count  agents  in  the  most  naive  way,  there  appears  to  be  a 
simple  rule  for  assigning  probabilities:  branch-counting.10  The  probability  that  H  is  true  is  given 
by  the  fraction  of  branches  in  which  H  holds.  Since  there  is  a  single  copy  of  the  experimenter  on 
each  branch,  the  fraction  of  branches  is  the  same  as  the  fraction  of  agents  and  branch-counting 
is  a  direct  consequence  of  Indifference. 

There  is  a  well-known  problem  for  this  kind  of  approach.* 11  The  number  of  branches  in 
which  a  certain  outcome  occurs  might  not  be  well-defined.  Branches  are  structures  in  the  wave 
function  which  emerge  via  decoherence,  and  although  this  process  will  guarantee  that  on  each 
branch  a  definite  outcome  is  observed,  it  will  generally  not  yield  a  good  answer  to  the  question 
of  how  many  branches  feature  a  certain  outcome  or  even  how  many  branches  there  are  in  total. 
In  contrast,  the  total  mod-squared  amplitude  of  branches  in  which  a  certain  outcome  occurs  is 
well-defined  (although  not  to  arbitrary  precision).  The  difficulty  of  branch-counting  provides 
good  reason  to  think  Indifference  will  often  be  unhelpful  in  Everettian  scenarios.  However, 
Indifference  does  give  clear  recommendations  in  idealized  scenarios,  so  for  the  time  being  we’ll 
restrict  our  attention  to  cases  where  the  number  of  branches  is  well-defined. 

Assume  that  Once-OR-Twice  really  is  a  case  in  which  one  branch  becomes  three,  as  depicted 
in  figure  1.  At  ,  what  probability  should  Alice  assign  to  particle  a  having  been  measured  to 
be  z-spin  down?  By  the  methods  of  textbook  quantum  mechanics,  we  should  use  the  Born  rule 
to  calculate  the  probability.  That  is,  we  should  square  the  coefficient  of  the  third  branch  of  the 
wave  function  in  (7), 


(11) 


To  apply  Indifference  we  simply  need  to  determine  the  fraction  of  branches  in  which  the  result 


10Lewis  ([2009a])  considers  a  proposal  along  these  lines. 

11For  more  in-depth  discussion,  see  Wallace  ([2007],  §9,  [2012],  §3.11). 
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of  the  first  measurement  was  down, 


Pt3  (down) 


number  of  down  branches 
number  of  branches 


1 

3  ’ 


(12) 


the  coefficients  of  1/2,  1/2,  and  l/-\/2  of  the  branches  in  I’I'oTfe))  are  irrelevant  to  the  cal¬ 
culation.  Supposing  Indifference  is  true  and  people  are  to  be  counted  in  this  straightforward 
way,  textbook  quantum  mechanics  and  Everettian  quantum  mechanics  make  different  predictions 
about  the  probability  of  seeing  down.  Thus,  we  can  empirically  test  the  many-worlds  interpre¬ 
tation  in  any  case  where  the  fraction  of  branches  in  which  some  hypothesis  H  is  true  is  unequal 
to  the  combined  weights  of  branches  in  which  H  is  true  (the  generic  case) . 

We  now  have  the  strongest  kind  of  argument  against  Everettian  quantum  mechanics:  it  is 
empirically  inadequate.  The  theory  fails  to  reproduce  the  empirical  predictions  of  textbook  quan¬ 
tum  mechanics  and  thus  has  been  conclusively  falsified  by  the  data  typically  taken  as  evidence 
for  quantum  mechanics.12  However,  this  argument  relies  on  the  assumption  that  Indifference 
is  a  universally  valid  principle  applicable  to  Everettian  multiverses  as  well  as  cases  of  classical 
duplication  like  Duplicating  Dr.  Evil.  As  we  will  see,  it  is  not. 


2.4  Against  Branch-counting 

When  Indifference  is  applied  to  Once-OR-Twice,  something  odd  happens.  At  ^  the  principle 
advises  each  copy  of  Alice  to  assign  a  probability  of  one  half  to  particle  a  being  down  as  half 
of  the  copies  of  Alice  are  in  a  branch  where  the  particle  was  measured  to  be  down.  Then,  at 
^3  Alice  is  supposed  to  assign  down  a  probability  of  one  third  (as  in  (12)).  According  to  the 
Born  rule,  the  recommended  probability  assignment  is  one  half  at  both  t2  and  1 3.  If  Indifference 
is  right,  there’s  a  strange  switch  in  the  probabilities  between  t2  and  1 3.  Is  there  any  reason  to 
think  this  undermines  the  branch-counting  strategy  advocated  by  Indifference ?  David  Wallace 
has  argued  that  such  a  switch  violates  a  constraint  he  calls  ‘diachronic  consistency.’  In  appendix 
A,  we  argue  that  this  is  not  the  right  diagnosis  of  the  problem  with  the  switch  in  credences.  This 
kind  of  ‘inconsistency’  is  a  common  result  of  Indifference  and  not  something  which  should  be 
taken  to  refute  the  principle.  Still,  we  agree  that  there’s  something  wrong  with  the  probability 
switch. 

Between  £2  and  what  happens?  Particle  b  is  measured  and  Bob  takes  note  of  the  result. 
Nothing  happens  to  Alice,  particle  a,  or  Alice’s  device.  If  nothing  about  Alice  or  her  detector 
changes,  why  should  her  degree  of  belief  that  she  bears  a  certain  relation  to  the  detector  change? 
At  t*2 ,  the  state  of  Alice  and  her  detector  is  given  by, 

Pad (^2)  =  2  \R)a  lt)i)  (t Id  (-^1  a  +  ^  I R)a  I4-)d  (-Ud  (-^Ia  •  (13) 

At  this  time  Alice  is  trying  to  locate  herself  in  one  of  the  two  branches  of  this  Alice+Detector 


12 Admittedly,  the  force  of  this  argument  is  weakened  by  the  fact  that  we  are  working  under  an  idealization. 
Branch  number  may  not  be  well-defined  in  realistic  cases  and  then  it  becomes  unclear  what  probabilities  the 
proponent  of  Indifference  thinks  one  should  assign. 
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system  and,  if  she  applies  Indifference ,  decides  that  she  should  assign  a  probability  of  one  half 
to  each  branch.  At  £3,  the  state  of  the  Alice+Detector  system  is  unchanged,  it  is  still  exactly  as 
in  (13).  Now  she  decides  that  the  probability  of  being  in  the  second  branch  is  one  third.  Why 
should  her  probability  for  being  in  different  subsystems  (branches  of  pad)  of  the  Alice+Detector 
system  change  when  nothing  about  that  system  changes  and  she  knows  that  she  is  somewhere  in 
that  system?  It  shouldn’t.  In  the  next  section  we  will  use  this  insight  to  motivate  a  replacement 
for  Indifference. 

It  is  tempting  to  think  that  the  number  of  copies  of  Alice  cannot  change  without  her  physical 
state  changing — this  is  the  way  things  work  in  classical  physics.  But,  in  Everettian  quantum 
mechanics,  changes  that  purely  affect  her  environment  can  change  the  number  of  copies  of  Alice 
in  existence.  For  example,  the  change  of  state  from  to  £3  in  Once-OR-Twice,  (7).  Two 
intuitive  constraints  come  into  conflict:  Indifference ,  and  the  belief  that  Alice’s  probabilities 
should  be  unaffected  by  changes  in  the  state  of  her  environment.  We  recommend  rejecting  the 
former  in  favor  of  the  latter. 


3  The  Epistemic  Irrelevance  of  the  Environment 

3.1  The  Epistemic  Separability  Principle 

In  §2.4  we  argued  that  branch-counting  is  unreasonable  because  it  requires  Alice  to  change  her 
credences  about  the  result  of  the  measurement  of  particle  a  when  things  change  elsewhere  in 
the  universe.  In  particular,  when  particle  b  is  measured.  In  the  Duplicating  Dr.  Evil 
case,  whatever  one  thinks  of  Indifference ,  it  seems  clear  that  certain  facts  are  irrelevant.  The 
probability  P(Tm  Dr.  Evil|I7)  should  not  depend  on  what’s  happening  deep  inside  the  Earth’s 
core  or  what’s  happening  on  the  distant  planet  Neptune  or  any  other  remote  occurrences.  Unless, 
of  course,  the  actions  of  the  Earth’s  core  cause  earthquakes  which  the  terrestrial  Dup  feels  but 
the  lunar  Dr.  Evil  does  not.  Or,  if  what’s  happening  on  Neptune  includes  another  copy  of  the 
laboratory  with  another  duplicate  Dr.  Evil  in  it.  If  there’s  a  duplicate  on  Neptune,  Dr. Evil 
can  no  longer  be  sure  that  Neptune  is  in  fact  a  distant  planet  and  not  the  one  under  his  feet 
(and  thus  cannot  treat  it  as  irrelevant  to  his  probability  assignments).  As  long  as  the  copies  of 
Dr.  Evil  are  unaffected  and  no  new  copies  are  created  elsewhere,  P(I’m  Dr.  Evil|[7)  should  be 
unaffected  by  changes  to  the  environment.  This  thought,  which  was  essential  to  our  argument 
in  §2.4,  can  be  stated  in  imprecise  slogan  form  as: 

ESP-gist  The  credence  one  should  assign  to  being  any  one  of  several  observers  having  identical 
experiences  is  independent  of  the  state  of  the  environment. 

In  more  precise  but  still  theory-independent  language,  this  epistemic  principle  becomes: 

ESP  Suppose  that  universe  U  contains  within  it  a  set  of  subsystems  S  such  that  every  agent  in 
an  internally  qualitatively  identical  state  to  agent  A  is  located  in  some  subsystem  which  is 
an  element  of  S.  The  probability  that  A  ought  to  assign  to  being  located  in  a  particular 
subsystem  X  £  S  given  that  they  are  in  U  is  identical  in  any  possible  universe  which  also 
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contains  subsystems  S  in  the  same  exact  states  (and  does  not  contain  any  copies  of  the 
agent  in  an  internally  qualitatively  identical  state  that  are  not  located  in  S). 

P(X\U)  =  P(X\S)  (14) 

ESP  stands  for  ‘Epistemic  Separability  Principle,’  as  the  principle  allows  one  to  separate  the 
relevant  parts  of  the  universe  from  the  rest;  that  is,  to  separate  the  set  of  subsystems  where 
the  agent  might,  for  all  they  know,  be  located  from  everything  else.  X ,  U,  and  S  are  not 
propositions;  (14)  is  shorthand  that  must  be  clarified.  P(X\U)  is  the  probability  that  A  assigns 
to  being  in  X  given  that  they  are  in  U.  In  P(X|<5>),  lS ’  is  shorthand  for:  there  exist  subsystems 
S  and  there  are  no  internally  qualitatively  identical  copies  of  A  outside  of  these  subsystems.  To 
be  precise,  let’s  say  that  one  is  ‘located  in’  subsystem  X  just  in  case  one  is  a  proper  part  of 
the  subsystem  (no  limb  or  brain  cell  is  omitted).  There  may  be  more  than  one  copy  of  A  in  a 
given  subsystem.  In  general,  There  will  be  many  ways  to  carve  out  a  set  of  subsystems  from 
the  universe.  ESP  applies  to  any  such  carving.  The  subsystems  in  S  together  need  not  cover 
the  entire  world,  there  may  well  be  parts  of  U  that  are  not  in  any  of  these  subsystems,  however 
these  omitted  parts  cannot  contain  copies  of  A.  The  principle  is  restricted  to  cases  where  S  has 
a  finite  number  of  members.  The  subsystems  need  not  be  located  at  the  same  time.  One  may, 
for  example,  be  unsure  if  they  are  the  person  waking  up  in  their  bed  on  Monday  or  Tuesday  (see 
§4-2). 

We  need  to  be  somewhat  careful  about  what  exactly  a  ‘subsystem’  is  supposed  to  be,  although 
we  will  not  attempt  to  give  a  rigorous  characterization  applicable  to  any  conceivable  theory.  The 
essential  idea  is  that  a  subsystem  is  a  part  of  the  larger  system  that  can  be  considered  as  a  physical 
system  in  its  own  right.  Slightly  more  formally,  we  imagine  that  the  overall  state  of  a  system  can 
be  decomposed  into  the  states  of  various  subsystems,  so  that  two  constraints  are  satisfied:  (1)  the 
state  of  each  subsystem,  perhaps  with  some  additional  information  about  how  the  subsystems 
are  connected,  can  be  used  to  uniquely  reconstruct  the  original  state;  and  (2)  the  information 
contained  within  each  subsystem’s  state  is  enough  to  specify  its  immediate  dynamical  evolution, 
as  long  as  the  other  subsystems  are  not  influencing  it.13  So,  in  classical  particle  physics,  a 
subsystem  might  be  a  collection  of  particles  at  a  time  and  its  state  specified  by  giving  the  masses, 
positions,  and  velocities  of  the  particles  in  the  collection.  By  contrast,  giving  only  the  positions 
(or  only  the  velocities)  of  a  collection  of  particles  would  not  specify  a  subsystem,  since  that 
wouldn’t  be  sufficient  to  determine  the  collection’s  evolution.  In  quantum  mechanics,  systems  can 
be  divided  into  subsystems  in  two  fundamentally  different  ways — quite  unlike  classical  physics. 
The  state  of  a  system  can  be  written  as  a  density  matrix  psys  corresponding  to  some  factor  of 
the  Hilbert  space  J$?Sys  (a  reduced  density  matrix  unless  the  system  is  the  whole  universe).  As 
Jtf’sys  can  itself  be  decomposed  into  factors,  Jfsys  =  ®  one  can  treat  the  reduced  density 

matrices  p\  and  p2  as  subsystems  of  psys .  Alternatively,  if — as  in  (13)-  branching  has  occurred, 

’’Work  would  need  to  be  done  to  formulate  these  constraints  precisely,  here  they  serve  as  rough  guides. 
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the  different  branches  of  psys  can  be  regarded  as  separate  subsystems.14 

Examination  of  the  argument  given  by  Elga  ([2004])  for  Indifference  reveals  that  something 
like  ESP  is  taken  for  granted.  In  his  Toss&Duplication  thought  experiment,  Elga  assumes 
that  the  outcome  of  an  additional  coin  toss  should  not  affect  the  credence  we  assign  to  being 
either  the  original  or  a  duplicated  person  with  identical  experiences;  the  justification  for  such 
an  assumption  would  have  to  be  something  like  ESP.  ESP  is  compatible  with  Indifference  in 
standard  cases  of  classical  self-locating  uncertainty  like  Duplicating  Dr.  Evil.  Requiring  that 
all  one  care  about  in  assigning  credences  between  Dr.  Evil  and  Dup  is  what’s  happening  in  the 
lunar  laboratory,  X ,  and  the  terrestrial  replica,  Y,  (together,  S  =  {X,  Y})  looks  like  it  allows 
any  assignment  of  credences  to  the  two  copies  at  all,  provided  one  is  consistent  across  universes 
that  vary  only  in  the  state  of  the  world  outside  the  two  laboratories.  Actually,  in  §4.1  we’ll 
see  that  although  ESP  is  compatible  with  Indifference,  the  rule  is  not  as  permissive  as  it  might 
initially  seem.  At  least,  not  if  we  strengthen  it  as  in  §4.1.  In  classical  cases,  the  strengthened 
principle  requires  one  to  consider  each  copy  of  oneself  to  be  equiprobable — in  agreement  with 
Indifference. 

Now  that  ESP  is  on  the  table,  let’s  apply  it  to  cases  of  measurement  in  Everettian  quantum 
mechanics.  Consider  Once-OR-Twice.  At  t2,  after  particle  a  has  been  measured,  Alice  has 
branched  into  multiple  copies.  Alice  knows  that  she  is  somewhere  in  the  Alice+Detector  system 
which  is  characterized  by  the  reduced  density  matrix  in  (13).  The  density  matrix  pad  itself  can 
be  divided  into  branches  X  &  Y  corresponding  to  the  particle  spin  being  measured  as  either  up 
or  down.  Any  universal  wave  function  which  agrees  on  the  density  matrix  pad  agrees  on  the 
state  of  X  and  Y.  So,  the  probability  of  being  in  X,  which  is  the  probability  of  an  up  result, 
must  be  the  same  in  any  universe  with  the  same  reduced  density  matrix  pad- 

More  generally,  suppose  that  an  experimenter  A  has  just  measured  observable  O  of  system  S 
and  the  measuring  device  has  recorded  some  eigenvalue  O,;  on  each  branch  of  the  wave  function.15 
As  discussed  in  §2.1,  the  reduced  density  matrix,  'pad,  will  be  diagonalized  in  the  pointer  basis  for 
the  Agent+Detector  subspace,  Ma  <8>  =^b,  by  the  decoherence  process.  Each  pointer  state  (with 
nonzero  amplitude)  defines  a  branch  of  the  reduced  density  matrix,  pad,  cm  which  the  detector 
D  registered  a  particular  outcome  O, .16  In  assigning  credences  to  the  different  outcomes,  the 
agent  is  assigning  probabilities  to  being  located  in  these  different  branches.  Specifying  the  state 
of  pad  determines  the  state  of  all  its  branches  and  thus  of  all  the  subsystems  in  which  the  agent 
might  find  themselves  ( S ). 

Thus  in  ordinary  cases  of  quantum  measurement,  we  can  formulate  a  less  general  version  of 
ESP  which  will  be  sufficient  for  our  derivation  of  the  Born  rule. 

14In  specifying  the  state  of  each  branch  of  a  reduced  density  matrix  like  (13)  one  must  retain  the  numerical 
prefactors  before  each  term  as  they  would  be  necessary  to  reconstruct  the  state  of  the  system  as  a  whole. 

15 ESP  is  not  only  applicable  to  Everettian  quantum  mechanics  in  cases  of  measurement.  It  can  be  applied 
whenever  one  is  trying  to  locate  oneself  within  a  collection  of  decohered  branches  of  the  wave  function.  For 
example,  this  decoherence  might  have  been  caused  by  chaotic  processes  instead  of  quantum  measurement  (Wallace, 
[2012],  ch.  3). 

16After  a  measurement,  the  branching  structure  of  pad — the  set  of  pointer  states — can  be  derived  either  by 
examining  the  interaction  with  the  environment  or  by  seeing  in  which  basis  the  matrix  is  diagonal.  One  need  not 
know  anything  about  the  state  of  the  environment;  the  reduced  density  matrix  alone  is  sufficient. 
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ESP-QM  Suppose  that  an  experiment  has  just  measured  observable  O  of  system  S  and  regis¬ 
tered  some  eigenvalue  Oi  on  each  branch  of  the  wave  function.  The  probability  that  agent 
A  ought  to  assign  to  the  detector  D  having  registered  Oi  when  the  universal  wave  function 
is  dt,  P(Oi |\Ef),  only  depends  on  the  reduced  density  matrix  of  A  and  D ,  Pad'- 

P(Oi\^)  =  P(Oi\pAD)  (15) 

This  principle  tells  us  that  when  observers  assign  probabilities  to  recorded  outcomes  of  measure¬ 
ments  that  have  already  occurred,  these  probabilities  should  only  depend  on  the  Agent -|-Detector 
state,  pad  (not  on  other  features  of  the  universal  wave  function).  By  applying  this  principle  to 
quantum  cases  instead  of  Indifference,  we  are  now  able  to  shake  the  unrealistic  assumption  that 
the  number  of  branches  in  which  a  certain  outcome  occurs  is  well-defined. 

In  formulating  ESP-QM,  we’ve  relied  on  the  fact  that  the  state  of  a  quantum  subsystem  is 
specified  by  a  reduced  density  matrix.  Although  this  is  the  standard  way  of  representing  sub¬ 
systems  in  quantum  mechanics,  one  might  worry  that  its  use  here  requires  further  justification. 
We  discuss  this  concern  in  appendix  B. 

3.2  Deriving  the  Born  Rule 

In  this  section  we  will  derive  the  Born  rule  probabilities  as  the  rational  assignment  of  credences 
post-measurement  pre-observation.  We  will  first  derive  the  rule  in  a  case  with  two  branches  that 
have  equal  amplitudes,  then  use  similar  techniques  to  treat  a  case  with  two  branches  of  unequal 
amplitude.  It  is  straightforward  to  extend  these  methods  to  more  general  cases  (see  appendix 
C).  Mathematically,  our  argument  is  most  similar  to  that  of  Zurek  ([2005])  and  not  far  from 
those  of  the  decision  theoretic  approach  (Deutsch,  [1999];  Wallace,  [2003b],  [2010c],  [2012]).  The 
interest  of  this  proof  is  not  its  mathematical  ingenuity  but  the  facts  that  (a)  it  applies  to  cases 
where  uncertainty  is  undeniably  present  and  (b)  it  is  based  on  a  single  well-motivated  principle 
of  rationality,  ESP. 

Proof  of  the  Born  rule  for  \/\  case:  Alice  measures  the  2-spin  of  a  single  particle  in  the 
x-spin  up  state.  One  display  (D 1)  will  show  the  result  of  the  experiment.  If  the  spin  is  up,  a 
second  display  (D 2)  will  show  O.  If  it  is  down,  a  <0  will  appear  on  the  second  display.  Alice  is  not 
immediately  affected  by  the  result;  in  particular,  she  is  for  a  time  unaware  of  the  experiment’s 
outcome.1'  The  wave  function  of  Alice,  the  detectors,  the  particle,  and  the  environment  (the 
rest  of  the  universe)  evolves  from 

l*0>  =  \Ro)a  I R)di  \R)d2  It.)  \Er)  (16) 


17That  is,  we  assume  that  Alice  is  in  the  same  physical  state  on  both  branches  of  the  wave  function  (|-R)^  in 
(17)).  As  time  passes  and  the  result  of  the  measurement  has  effects  on  various  parts  of  the  universe,  Alice’s  state 
will  come  to  differ  on  the  two  branches  (e.g.,  if  she  observes  the  result).  But,  immediately  after  measurement  her 
state  will  be  unaffected. 
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to 

l*i)  =  ^  I R)a  lt)D1 \V)d2  Itz)  |^)  +  ^  I R)a  \i)m  \0)D2 14-*)  |^o>  •  (17) 

To  use  ESP-QM  to  demonstrate  that  P  (t|\Eri)  =  P  (44  *i)  =  1/2,  we  will  need  to  also  consider 
an  alternate  scenario  where  the  computer  (part  of  the  environment)  is  programmed  differently 
so  that  *s?  displays  if  down  is  measured  and  <>  displays  if  up.  Then  the  post-measurement 
pre-observation  wave  function  would  be 

1*2}  =  | R)a  |t)D1  10)^2  It*)  l^o)  +  I R)a  I4)d1  1^2  Itz)  |^)  •  (18) 

Step  1:  Focus  first  on  Alice  and  D 1.  The  Alice+Detector  1  reduced  density  matrices  for  H/i 
and  T2  are  the  same,18 

PADl  (*l)  —  PAD!  (*2)  =  ^  \R)A\t)m  (P\a  (tlci  +  2  I^)a  lt)_Di  (-^Ia  (tlui  ■  (19) 

ESP-QM  requires  that  the  probabilities  Alice  assigns  to  the  possible  spin  results  be  the  same  in 
these  two  universes  as  they  have  the  same  Observer -|-Detector  reduced  density  matrix, 

P(l\*1)=P(l\V2)  .  (20) 

Step  2  :  If  we  ask  what  probability  Alice  should  assign  to  the  display  being  O,  we  need  to 
consider  the  reduced  density  matrix  generated  by  tracing  over  D 1,  the  spin  of  the  particle,  and 
the  environment.  T,  and  vp2  agree  on  pad2-  By  ESP-QM,  the  probabilities  assigned  to  O  must 
be  equal, 

p(v\*1)  =  p(y\*2)  ■  (21) 

Step  3  :  Next,  note  that  the  O-branches  just  are  the  f-branches  in  Ti  and  the  0-branclies  just 
are  the  4,-branches  in  T2.  Thus  Alice  is  in  the  O-branch  of  pad2(*i)  if  and  only  if  she  is  in  the 
t-branch  of  /3adi(* i)-  Similarly,  she  is  in  the  O-branch  of  pad2(*2)  if  and  only  if  she  is  in  the 
^-branch  of  pad i(*2)-  Therefore,  Alice  must  assign 

P  (tl*i)  =  P  (0|'Ef1) 

P(4.|tf2)  =  P(0|tf2)  .  (22) 

Step  4:  Putting  together  the  results  in  (20)  -  (22),  we  see  that  in  'Pi  the  probability  of  being  on 
a  t/O-branch  must  be  the  same  as  that  for  being  on  a  4./<4>-branch:  P  (t|*i)  =  P  (4-|*i)-  So,  the 
unique  rational  degrees  of  belief  in  the  first  scenario  consider  each  branch  to  be  equiprobable. 
Since  these  are  the  only  two  alternatives,  the  probability  of  each  outcome  is  one  half.  In  fact, 
using  ESP-QM,  we  have  shown  that  for  any  case  in  which  the  reduced  density  matrix  is  as  in 
(19),  the  two  spin  states  are  equiprobable;  there  doesn’t  have  to  be  a  second  display  present. 


18Here  we  assume  for  simplicity  that  the  environments  are  perfectly  orthogonal.  What  is  crucial  is  that  we 
choose  ^2  such  that  'padi  (^l)  —  PADl  (^2)  and  PAD2  (^1)  =  PAD2  (^2)5  both  of  which  can  be  easily  satisfied 
even  if  the  environment  states  are  not  perfectly  orthogonal. 
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The  result  applies  to  a  state  in  the  general  form: 


^2  I^M  lt)m 

Proof  of  the  Born  rule  for  |/| 


lt*>  lEr)  +  ±  I R)a  U)D1  llz)  I Ei)  (23) 

case:  Suppose  Alice  measures  a  particle  in  the  state 


(24) 


in  which  case  upon  measurement  padi  would  be 

Padi  (^2)  =  g  \R)a  It) x>i  (E\a  (tlm  +  g  \R)a  l-Om  (E\a  (-Udi  • 

To  determine  the  probabilities  in  this  scenario,  we  will  consider  two  different  ways  of  having 
three  displays  linked  to  the  measurement  outcomes  in  the  post-measurement  pre-observation 
state, 


I'M  =  (  I R)a  lt>m  \0)D2  I *)D3  It.)  \Eal)  +  | R)a  ft)m  P)D2  |*>D3  It.)  \Ea2) 

+  \P)  A  1 4-) D1  l^*) D2  1^) D3  14-z)  IM3)  ^ 

I'M  =  yi (  I R)a  |t)D1  \V)D2  |*)B3  It*)  \Epi)  +  | R)a  lt)m  \V)D2  \*)m  It.)  I E02) 

+  |-^) A  14') D1  1^) D2  1^) D3  14-*)  \Ef33)  ^  (26) 

Step  1:  Using  ESP-QM  to  ignore  D 2,  D 3,  and  the  environment,  we  can  focus  on  the  first 
display  and  compare  the  probabilities  for  J,, 


pma)  =  pm0)  ■ 


Step  2:  Focusing  on  the  second  display  gives 


(27) 


P(Opa)  =P(0P8)  .  (28) 

Since  the  <)-branches  of  4^  are  the  ^-branches,  we  have 

p(0|'M  =  PUI'M  •  (M 

Step  3:  Similarly,  focusing  on  D 3  and  noting  that  the  ^.-branches  of  4/^  are  the  ^-branches 
yields 

p  (♦!*«)  =  p  41*0)  •  (30) 

Step  4:  Combining  (27)  -  (30)  gives 

P  (0|M)  =  P  (*|*a)  =  P  (IIM)  ■  (31) 
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Since  these  are  all  of  the  possibilities,  the  probability  of  each  is  a  third  and 

P{l\*a)  =  l 

^(tl®a)  =  |.  (32) 

This  result  holds  whenever  the  reduced  density  matrix  is  as  in  (25). 

The  logic  of  this  section  suggests  a  way  of  thinking  about  the  Born  rule  at  an  intuitive  level. 
Our  recipe  amounts  to  the  following  prescription:  write  the  state  vector  as  a  sum  of  orthogonal 
vectors  with  equal  amplitudes  by  unitarily  transforming  the  environment.  Then  (and  only  then), 
ESP  justifies  according  equal  credence  to  each  such  basis  vector.  In  that  sense,  the  Born  rule  is 
simply  a  matter  of  counting.  However,  we  don’t  need  to  take  this  picture  literally. 

Now  that  ESP-QM  has  revealed  its  power,  one  might  reasonably  suspect  that  we  have  given 
it  too  much.  Let’s  take  a  moment  to  reflect  on  the  principle’s  plausibility.  As  was  discussed  in 
§2.4  and  3.1,  if  ESP-QM  is  correct  then  Alice’s  credence  that  the  result  was  up  should  not  change 
between  t2  and  £3  in  Once-Or-Twice  even  though  the  number  of  copies  of  Alice  changes.  The 
proof  for  the  |/|  case  above  relies  on  the  same  trick:  the  reduced  density  matrix  in  (25)  could 
describe  a  case  of  three  detectors  with  three  different  combinations  of  outputs — as  in  (26) — or  a 
single  detector  with  two  different  outputs.  By  ESP-QM  the  probability  of  an  up  result  must  be 
the  same  in  either  case.  When  ESP  was  introduced  it  was  immediately  restricted  to  only  apply 
in  cases  where  the  changes  made  to  the  environment  do  not  involve  the  creation  of  additional 
copies  of  the  agent  elsewhere.  Why?  We  cannot  claim  that  the  problem  is  that  such  changes 
increase  the  number  of  copies  in  existence — the  change  from  t2  to  t3  in  Once-Or-Twice  does 
too.  One  problem  with  omitting  the  restriction  is  that  it  would  be  impossible  to  assign  non-zero 
credences  to  the  additional  copies  in  the  environment,  since  the  agent  would  have  to  assign  the 
same  probabilities  to  all  of  the  original  copies  whether  or  not  additional  copies  are  present.  This 
problem  does  not  arise  when  ESP-QM  is  applied  to  the  quantum  case.  The  change  from  t2  to  t3 
in  Once-Or-Twice  increases  the  number  of  copies  of  Alice  in  existence,  but  she  is  not  forced  to 
assign  any  of  the  copies  probability  zero.  The  fact  that  ESP-QM  avoids  this  particular  problem 
does  not  fully  exonerate  the  principle.  The  change  in  the  quantum  state  from  t2  to  t3  could 
conceivably  turn  out  to  be  relevant.  The  motivation  for  ESP-QM ,  emphasized  earlier,  is  that 
when  Alice  is  wondering  about  her  relation  to  some  detector  D ,  things  happening  elsewhere  are 
in  fact  not  relevant.  We  believe  the  principle  to  be  well- motivated  but  not  established  beyond 
any  doubt,  and  thus  our  derivation  is  provisional:  if  one  can  focus  on  the  reduced  density  matrix 
in  calculating  probabilities  as  recommended  by  ESP-QM ,  the  Born  rule  follows. 

4  Varieties  of  Uncertainty 

4.1  ESP  and  Indifference 

In  classical  physics,  you  experience  self-locating  uncertainty  when,  for  example,  the  universe 
is  so  large  that  you  should  expect  there  to  exist  a  distant  planet  where  someone  is  having 
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the  exact  same  immediate  experiences  that  you  are,  or  when  the  universe  survives  so  long 
that  you  should  expect  short-lived  Boltzmann  brains  to  pop  out  of  the  vacuum  in  the  exact 
subjective  state  you  are  in  now,  or  when,  as  in  Duplicating  Dr.  Evil,  you  have  reason  to 
think  someone  has  purposefully  created  a  duplicate  of  you.  Such  phenomena  can  also  occur 
in  quantum  mechanical  contexts,  leading  to  within-branch  uncertainty.  Once  the  principle  is 
strengthened,  ESP  mandates  that  this  sort  of  uncertainty  should  be  treated  with  Indifference. 
This  brings  out  an  important  virtue  of  the  epistemic  principle  we’ve  proposed:  ESP  explains 
why  Indifference  was  a  good  heuristic  valid  in  a  wide  variety  of  cases  and  also  explains  why  the 
uncertainty  arising  from  quantum  measurements  should  be  treated  differently,  using  the  Born 
rule.19 

To  apply  ESP  fruitfully  to  classical  cases,  it  will  help  to  strengthen  the  principle.  The  stronger 
version  of  ESP  is  not  concise  when  stated  precisely  (below),  but  the  basic  idea  is  simple:  Not 
only  should  it  be  irrelevant  what’s  going  on  outside  of  the  subsystems  in  S ,  it  also  should  be 
irrelevant  where  in  spacetime  each  subsystem  in  S  is  located.  Consider  again  Duplicating  Dr. 
Evil.  It  seems  irrelevant  where  the  philosophers  decide  to  build  the  replica  of  the  lab.  They 
could  make  it  in  America  or  Japan  or  on  Mars.  The  choice  shouldn’t  affect  the  probability  that 
Dr.  Evil  assigns  to  being  on  the  moon.  Perhaps  more  controversially,  we  believe  it  is  irrelevant 
when  they  build  the  replica.  Suppose  they  tell  Dr.  Evil  that  they  are  scanning  his  lab  now  and 
will  (unstoppably)  make  the  replica  next  week.  In  such  a  case,  Dr.  Evil  should  start  to  wonder 
whether  he’s  mistaken  about  the  date.  Further,  we  believe  his  doubts  about  being  on  the  moon 
should  not  be  mitigated  by  the  temporal  separation  between  himself  and  the  replica. 

Strong  ESP  Suppose  that  universe  U  contains  within  it  a  set  of  subsystems  S  such  that  every 
agent  in  an  internally  qualitatively  identical  state  to  agent  A  is  located  in  some  subsystem 
which  is  an  element  of  S  =  {X,  Y, ...}.  Let  S'  =  {Mx(X),  My(Y), ...}  where  each  M(-) 
is  a  transformation  which  rotates,  spatially  translates,  and/or  temporally  shifts20  the  sub¬ 
system.  The  probability  that  A  ought  to  assign  to  being  located  in  a  particular  subsystem 
X  £  S  given  that  they  are  in  U  is  identical  to  the  probability  that  they  ought  to  assign  to 
being  in  Mx(X)  £  S'  given  that  they  are  in  some  universe  U'  (which  contains  within  it  a 
set  of  subsystems  S'  such  that  every  agent  in  an  internally  qualitatively  identical  state  to 
agent  A  is  located  in  some  subsystem  which  is  an  element  of  S'). 

P(X\U)  =  P(Mx(X)\U')  (33) 

Here  Mx(X)  is  a  relocated  version  of  X ,  My(Y)  is  a  relocated  version  of  Y .  etc.  Since  My(Y) 
is  simply  a  relocated  version  of  subsystem  Y .  being  in  My(Y)  will  feel  just  like  being  in  Y.  One 


19In  a  recent  paper,  (Wilson,  [forthcoming])  has  offered  a  clever  alternative  explanation  as  to  why  within-branch 
uncertainty  should  be  treated  differently  from  which-branch  uncertainty:  different  Everettian  worlds  are  in  fact 
different  possible  worlds  and  thus  Indifference  does  not  constrain  one’s  treatment  of  which-branch  uncertainty. 
Our  analysis  goes  further  than  just  explaining  the  difference  between  the  two  kinds  of  uncertainty,  we  also  explain 
the  commonality  (by  giving  a  unified  account  of  how  both  the  Born  rule,  quantifying  which-branch  uncertainty, 
and  Indifference ,  quantifying  within-branch  uncertainty,  arise  from  a  single  core  epistemic  principle,  ESP). 

20  Strong  ESP  loses  some  of  its  plausibility  if  these  three  transformations  are  not  symmetries  of  the  dynamical 
laws  of  U  and  U' .  Here  we  assume  that  they  all  are. 
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might  worry  that  the  fact  that  My(Y)  is  surrounded  by  a  different  local  environment  than  that 
around  Y  could  make  My(Y)  and  Y  distinguishable.  If  the  replica  of  the  laboratory  is  built 
floating  in  outer  space  instead  of  sitting  on  Earth’s  surface,  the  instantaneous  arrangement  of 
furniture  and  fermions  may  be  the  same,  but  the  copy  of  Dr.  Evil  would  quickly  notice  the 
absence  of  a  force  keeping  his  feet  on  the  floor.  Still,  if  we  focus  on  the  instant 21  when  the  outer 
space  replica  really  is  a  perfectly  shifted  version  of  the  terrestrial  replica,  all  of  the  particles  in 
the  each  agent’s  brain  are  in  the  same  arrangement  and  the  outer  space  copy  must  be  having 
exactly  the  same  experiences  as  those  had  by  the  terrestrial  copy  (and  thus  the  same  experiences 
as  those  of  the  original  Dr.  Evil). 

Using  Strong  ESP,  it  is  straightforward  to  prove  that  one  must  follow  the  recommendations 
of  Indifference  in  cases  of  classical  duplication.  In  fact,  the  proof  is  so  quick  that  it  may  cause 
you  to  doubt  the  principle.  It  shouldn’t.  Indifference  was  intuitively  plausible.22  Strong  ESP 
retains  those  intuitive  recommendations  while  avoiding  the  unacceptable  recommendations  in 
cases  like  Once-or-Twice  (§2.4). 

We’ll  first  apply  Strong  ESP  to  Duplicating  Dr.  Evil  and  then  move  to  the  general  case. 
Let  A'  be  the  lunar  laboratory  and  all  of  its  contents  (including  Dr.  Evil)  and  Y  be  the  terrestrial 
replica  and  its  contents.  Let  Mx(X)  move  the  lunar  laboratory  to  where  the  terrestrial  replica 
was  and  My(Y)  move  the  lunar  laboratory  to  where  the  terrestrial  replica  was  ( Mx(X )  =  Y  & 
My(Y)  =  X,  provided  the  two  laboratories  are  in  truly  identical  physical  states).  Take  U  to  be 
the  original  world  and  U'  to  be  the  universe  you  get  by  switching  the  two  laboratories  (which, 
as  it  happens,  doesn’t  change  anything:  U'  =  U ).  Then,  (33)  yields 

P(X\U)  =  P{Mx{X)\U')  =  P(Y\U )  .  (34) 

Generalizing  to  arbitrary  many  copies  of  an  agent  is  simple.  By  pairwise  swaps  like  the  one  above 
it  can  be  shown  that  any  two  copies  are  equiprobable.  Thus  when  we  restrict  attention  to  cases 
where  the  agent’s  copies  are  physically  identical  and  not  merely  having  identical  experiences,  the 
recommendations  of  Strong  ESP  align  with  those  of  Indifference.  The  recommendations  even 
agree  in  cases  of  quantum  measurement  when  the  agent’s  copies  are  truly  identical,  as  in  equal 
amplitude  superpositions  like  (17). 

The  above  justification  for  assigning  equal  credence  to  being  in  the  lunar  and  terrestrial 
laboratories  in  the  Duplicating  Dr.  Evil  case  at  no  point  relied  on  the  laws  being  classical. 
One  could  equally  well  apply  the  reasoning  to  cases  of  within-branch  uncertainty.  For  simplicity, 
focus  on  a  single  branch  which  contains  a  copy  of  Dr.  Evil  on  the  moon  and  another  on  Earth. 
The  state  might  then  be  represented  as  |\1/)  =  |L)|T)|.E),  where  | L)  is  the  lunar  lab,  | T)  is 


21  If  one  thinks  (reasonably  enough)  that  it  takes  time  to  have  an  experience,  the  subsystems  X ,  Y,  ...  must 
be  taken  to  be  temporally  extended  if  they  are  to  contain  agents  having  experiences.  For  example,  let  Y  be  the 
terrestrial  laboratory  persisting  over  the  course  of,  say,  one  minute.  Moving  Y  into  outer  space  wouldn’t  change 
what’s  happening  inside  the  laboratory  during  that  minute  at  all.  It  would,  however,  involve  a  violation  of  the 
laws  of  nature  (as  the  felt  downward  force  would  have  no  source)  unless  other  appropriate  changes  were  made 
(like  the  addition  of  an  appropriately  sized  planet  beneath  the  laboratory’s  floor). 

22In  fact,  it’s  not  just  intuitive,  there  are  arguments  for  it.  The  argument  given  by  Elga  ([2004])  essentially 
relies  on  something  like  ESP,  as  we  discussed  in  Section  3.1. 
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the  terrestrial  lab,  and  | E)  is  everything  else.  Take  subsystem  X  to  be  the  lunar  laboratory, 
represented  by  the  reduced  density  matrix  \L)(L\,  and  Y  to  be  terrestrial  one,  |T)(T|.  We  can 
define  two  unitary  shift  operators:  Sx  which  takes  |  L)  to  | T)  and  Sy  which  takes  | T)  to  |  L). 
With  these  operators  we  can  express  Mx  and  My  mathematically  as  M x{-)  =  Sx(-)Sx  and 
My(-)  =  Sy(-)Sy.  The  remainder  of  the  argument  proceeds  as  above.  Note  that  the  division 
into  subsystems  is  different  here  than  in  §3.2.  There  we  assumed  you  know  what  particles  you 
are  made  of  and  are  trying  to  determine  what  branch  of  a  particular  reduced  density  matrix 
you  inhabit.  Here  we’ve  focused  on  the  question  of  what  you’re  made  of — that  is,  which  reduced 
density  matrix  gives  the  state  of  the  particles  that  compose  your  body. 

4.2  Mixed  Uncertainties 

We  have  now  seen  that,  depending  on  the  scenario,  probability  in  quantum  mechanical  contexts 
is  sometimes  handled  by  the  Born  rule  and  other  times  handled  by  Indifference.  What  about 
cases  where  the  two  types  of  uncertainty  are  mixed — when  one  knows  neither  where  they  are 
in  spacetime  nor  which  branch  they  are  on?  Here  we  must  rely  on  the  rule  from  which  both 
individual  prescriptions  arise:  Strong  ESP.  In  this  section,  we’ll  discuss  two  illustrative  ‘quan¬ 
tum  sleeping  beauty’  scenarios  that  combine  both  kinds  of  uncertainty  (Lewis,  [2007],  [2009b]; 
Papineau  &  Dura-Vila,  [2009a],  [2009b];  Peterson,  [2011];  Bradley,  [2011],  [forthcoming];  Wilson, 
[2014];  Groisman  et  al.  ,  [2013]). 

Two-Branch-Beauty  The  experimental  subject,  Beauty,  will  be  put  to  sleep  on  Sunday  night. 
While  she  is  asleep,  there  will  be  a  z-spin  measurement  of  particle  a  which  is  initially  x-spin 
up.  If  the  result  is  up,  she  will  be  awoken  on  Monday  and  then  her  memory  of  Monday’s 
events  will  be  erased  so  that  on  Tuesday  when  she  wakes  up  her  last  memories  will  be  of 
going  to  sleep  Sunday  night.  If  the  result  is  down,  she  will  not  have  her  memory  erased. 
Beauty  knows  everything  that  might  be  relevant  about  the  setup. 


Figure  2:  Two  quantum  sleeping  beauty  scenarios. 
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Consider  Beauty’s  situation  upon  waking  with  her  most  recent  memories  being  those  of 
going  to  bed  Sunday  evening.  Her  evidence  doesn’t  discern  between  three  possible  locations  in 
the  multiverse  where  she  might  be:  Monday  morning  on  a  branch  where  a  was  measured  to  be 
up,  Mf,  Tuesday  morning  on  a  branch  where  a  was  up,  Tf,  or  Monday  morning  on  a  branch 
where  a  was  down,  Mf.  Upon  waking,  what  probability  should  Beauty  assign  to  particle  a  being 
up?  That  is,  what  probability  should  she  assign  to  Mf  V  Tf!  By  .ESP-based  arguments  like 
those  in  §3.2,  she  should  assign  equal  probability  to  Mf  and  Mf, 

P(Mt)=P(Ml).  (35) 

However,  since  M f  and  M j,  are  not  the  only  alternatives,  it  does  not  follow  that  she  should  assign 
a  probability  of  one  half  to  each. 

Given  that  a  was  spin  up,  we’re  dealing  with  a  case  of  within-branch  uncertainty  about  what 
day  it  is.  Here  Indifference  and  Strong  ESP  agree  that  Alice  should  assign  equal  probability 
to  it  being  Monday  or  Tuesday  morning  (provided  that  the  Monday  and  Tuesday  Alices  are  in 
identical  physical  states,  a  useful  albeit  unrealistic  assumption), 

P  (Mt)  =  P  (Tf)  .  (36) 

Since  Mf,  Mf,  and  Tf  are  the  only  three  options,  it  follows  from  (35)  and  (36)  that  Alice  should 
assign  each  alternative  a  probability  of  one  third.  The  answer  to  our  original  question  is  that 
P(t)  =  P  (Mf  V  Tf)  ought  to  be  two  thirds.  We  thus  recover  the  popular  ‘thirder’  result  for 
this  kind  of  sleeping  beauty  scenario.23 

Two-Branch-Beauty  may  appear  to  be  merely  a  fanciful  philosopher’s  concoction,  of  little 
to  no  physical  importance.  In  fact,  the  case  is  importantly  similar  to  inflation  scenarios  where 
some  quantum  branches  have  no  life,  others  have  life,  and  some  have  so  much  life  that  one  can 
reasonably  expect  that  one’s  own  experiences  are  duplicated  somewhere  else  in  spacetime.  If  we 
hope  to  develop  a  framework  for  testing  theories  like  this,  we  need  epistemic  principles  which 
can  handle  uncertainty  about  both  which  branch  one  is  on  and  where  one  is  within  that  branch. 

There’s  something  odd  about  the  recommendations  of  Strong  ESP  in  Two-Branch-Beauty. 
The  probability  that  the  principle  tells  Beauty  to  assign  to  up  is  2/3  whereas  the  Born  rule  rec¬ 
ommends  a  probability  of  1/2.  Is  the  Born  rule  invalid?24  We  think  that  the  right  lesson  to 
draw  is  that  there  are  two  sources  of  uncertainty  here  and  the  Born  rule  is  perfectly  correct  for 
quantifying  the  uncertainty  brought  about  by  quantum  measurement.  However,  there  is  also 
uncertainty  arising  from  the  duplication  of  Beauty’s  experiences  on  the  up  branch.  When  both 
types  of  uncertainty  are  present,  we  need  a  rule  for  aggregating  them.  Strong  ESP  is  capable  of 
doing  this,  as  discussed  here  and  in  §4.3. 


23Vaidman  ([2011],  §6)  gets  the  same  result  in  this  case  through  his  consideration  of  self-locating  uncertainty. 
24See  the  discussion  of  cosmological  cases  where  it  appears  that  the  Born  rule  is  untenable  in  (Page,  [2009a]); 
see  also  §4.3. 
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To  understand  what’s  special  about  the  Two-Branch-Beauty  case  discussed  above,  it  is 
worth  considering  a  variant  of  the  case. 

Three-Branch-Beauty  Beauty  will  be  put  to  sleep  on  Sunday  night.  While  she  is  asleep, 
there  will  be  a  z-spin  measurement  of  particle  a  which  is  initially  ar-spin  up.  If  the  result  is 
up,  particle  b  (identically  prepared)  will  also  be  measured:  If  b  is  up,  Beauty  will  awake  on 
Monday  morning;  if  down,  Beauty  will  be  kept  asleep  until  Tuesday  morning.  If  particle  a 
is  measured  to  be  down,  particle  b  will  not  be  measured  and  Beauty  will  awake  on  Monday. 
Beauty  knows  the  setup. 

This  is  simply  a  case  of  repeated  quantum  measurements,  essentially  the  same  setup  as  Once-OR- 
Twice.  Here  we  can  run  arguments  like  those  in  §3.2  to  show  that  Beauty  should  assign  Born  rule 
probabilities:  P  (Mf)  =  1/4,  P  (T-j.)  =  1/4,  and  P  (MjJ  =  1/2. 25  Comparing  Three-Branch- 
Beauty  and  Two-Branch-Beauty,  we  see  that  Strong  ESP  explains  how  the  popular  thirder 
solution  can  be  correct  for  Two-Branch-Beauty  whereas  the  halfer  solution  is  correct  in  the 
quantum  Three-Branch-Beauty  case  (solving  the  mystery  of  Lewis,  [2007]). 


4.3  Large  Universe  Cosmology  and  the  Quantum  Multiverse 

The  tricks  of  §4.2  can  be  used  to  give  recommendations  in  the  general  case.  Consider  a  universe 
U — recall  that  in  our  terminology  ‘universe’  means  ‘entire  quantum  multiverse’ — with  a  set  of 
observers  {Oi}  who  find  themselves  in  indistinguishable  circumstances.26  They  may  be  on  the 
same  branch  of  the  wave  function,  on  different  branches,  or  even  at  different  times.  For  each 
observer  Oi  existing  at  some  time  tj,  the  overall  state  describing  the  universe  can  be  written  in 
the  form 

I ’Uh))  =  £*»]&)  +  0i\ 4>i)  ,  (37) 

where  |</>j)  is  the  branch  on  which  the  observer  lives  and  |  </>/-)  is  the  remainder  of  the  quantum 
state,  including  all  other  branches.  \(j>S)  is  (at  least  approximately)  orthogonal  to  | (j>i). 

In  this  case,  by  the  considerations  above,  Strong  ESP  provides  an  unambiguous  procedure 
for  assigning  credences.  To  each  observer  we  assign  a  weight 

Wi  =  \ai\2  .  (38) 

Then  the  probability  for  being  observer  Oi  is  simply2' 

P{Oi\U)  =  =^-  .  (39) 

Ei  wj 


25  A  wrinkle:  Before  we  can  apply  the  methods  of  §3.2,  one  must  first  shift  attention  to  a  universe  where  the 
T-f  copy  of  Beauty  is  temporally  translated  back  to  Monday.  Strong  ESP  says  that  the  probabilities  will  be  the 
same  in  such  a  universe.  However,  this  new  case  is  easier  to  handle  as  it  is  exactly  analogous  to  Once-OR-Twice. 

26  Here  we  work  under  the  fiction  that  the  observers  can  be  easily  distinguished,  labeled,  and  counted.  In 
actuality  there  may  be  multiple  ways  of  carving  the  wave  function  into  branches  (see  §2.2).  Corresponding  to 
different  ways  of  carving  the  wave  function  into  branches,  there  will  be  different  collections  of  observers  Oi. 
However,  predictions  derived  using  (39)  will  not  depend  significantly  on  the  particular  carving.  In  applying 
Strong  ESP  we  also  assume  that  the  observers  are  physically  identical  (modulo  the  weights  of  their  branches). 

27Groisman  et  al.  ([2013])  have  proposed  essentially  the  same  rule. 
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Note  that  the  weights  {w^}  will  not  in  general  sum  to  unity  for  a  variety  of  reasons:  the  weights 
may  be  calculated  at  different  times  for  different  observers;  there  may  be  multiple  observers  on 
a  single  branch;  there  may  be  branches  with  no  observers.  This  rule  reduces  to  Indifference  in 
the  case  of  multiple  observers  on  a  single  branch  and  to  the  Born  rule  when  there  is  a  single 
observer  on  each  branch. 

This  recipe  provides  a  resolution  of  the  ambiguity  in  applying  the  Born  rule  in  large  universes 
that  was  identified  by  Page  ([2009a],  [2009b],  [2010]);  see  also  (Aguirre  &  Tegmark,  [2011];  Al¬ 
brecht  &  Phillips,  [2014]).  Consider  a  universe  that  is  large  enough  to  contain  multiple  observers 
with  identical  experiences  on  the  same  branch  of  the  wave  function.  Imagine  that  each  ob¬ 
server  plans  to  measure  the  0-spin  of  their  particle,  each  particle  being  in  a  potentially  different 
pre-measurement  state: 

k»)  =7*1  tz)  +  <5i|  I*)  •  (40) 

In  simple  cases  of  quantum  measurement,  the  Born  rule  can  be  expressed  as  the  statement  that 
the  probability  of  an  observational  outcome  is  given  by  the  expectation  value  of  a  projection 
operator.  For  example,  in  an  individual  state  of  the  form  (40),  the  probability  of  observing 
spin-up  is  P(tzki)  =  (<7,; |II-|- Icq)  =  |7i|2,  where  II7  =  |tz)(tz|-  Page  shows  that  there  is  no 
projection  operator  that  gives  the  probability  that  an  observer,  not  knowing  which  observer 
they  are,  will  measure  fz  or  j,z;  in  that  sense  the  Born  rule  is  insufficient  to  fix  the  probabilities 
of  measurement  outcomes.  Strong  ESP  resolves  this  ambiguity;  using  (39),  the  probability  of 
0-spin  up  (post-measurement,  pre-observation)  is  simply  given  by 

p(uu)  =  Y,p(°i\u)x  w2  >  (41) 

i 

and  analogously  for  0-spin  down.  Here  P(Oi\U)  are  the  probabilities  assigned  to  being  different 
observers  before  the  measurement  is  conducted. 

This  result  has  consequences  for  the  measure  problem  in  cosmology  (Freivogel,  [2011];  Salem, 
[2012];  Vilenkin,  [2012]),  although  we  will  not  explore  them  in  detail  here.  We  will  only  note 
that  the  probability  one  should  assign  to  being  a  particular  observer  in  the  multiverse  clearly 
depends  on  the  amplitude  of  the  branch  on  which  that  observer  finds  themselves.  For  example, 
consider  one  evolving  branch  on  which  the  temporal  density  of  observers  grows  exponentially, 
nft)  oc  ewt,  such  as  might  happen  in  inflationary  cosmology.  Let  B  be  the  subset  of  observers 
who  live  on  that  branch.  According  to  our  prescription,  the  probability  one  should  assign  to 
being  on  that  branch  is  not  proportional  to  the  integral  of  nft )  over  time,  which  is  obviously 
infinite.  Rather,  it  should  be  weighted  by  the  amplitude  aft)  of  the  corresponding  branch: 

P(B\U)  oc  J  dt\a(t)\2n(t).  (42) 

If  the  the  amplitude  is  decaying  exponentially  in  time,  aft )  oc  e~Jt,  we  will  have  a  well-defined 
finite  probability  for  being  a  member  of  B  as  long  as  7  >  w/2.  If  7  <  w/2,  the  numerator  and 
denominator  of  (39)  both  go  to  infinity  and  the  rule  fails  to  give  a  well-defined  probability  for 
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being  in  B. 


5  Probability  in  Practice 

In  §3.2  we  showed  that,  post-measurement  pre-observation,  agents  should  assign  probabilities 
to  measurement  outcomes  in  accordance  with  the  Born  rule.  This  alone  is  not  a  sufficiently 
strong  result  to  show  that  the  nrany-worlds  interpretation  recovers  all  of  the  important  aspects 
of  quantum  probability.  There  remain  two  key  problems,  identified  in  Papineau  ([1996]);  Greaves 
([2007a]).  First,  the  practical  problem.  When  faced  with  decisions  whose  repercussions  depend 
upon  the  outcomes  of  future  quantum  measurements,  why  should  agents  act  as  they  would  if 
only  one  outcome  were  going  to  occur  with  probability  determined  by  the  Born  rule?  Second, 
the  epistemic  problem.  Why  can  we  infer  facts  about  the  wave  function  from  observed  long-run 
frequencies?  Also,  why  do  the  data  usually  taken  as  evidence  for  quantum  mechanics  provide 
evidence  for  Everettian  quantum  mechanics?  As  we  are  primarily  concerned  with  our  reasons  for 
believing  in  the  many-worlds  interpretation,  we  focus  on  the  epistemic  problem.  The  practical 
problem  is  less  urgent.  If  Everettian  quantum  mechanics  is  well-confirmed  by  the  evidence  but 
turns  out  to  recommend  that  we  act  differently,  then  we  would  do  well  to  adjust  the  way  we 
make  decisions.  In  §5.1  we  suggest  that  perhaps  there  is  no  adjustment,  that  an  Everettian  can 
continue  to  act  as  if  a  single  outcome  will  occur  with  probability  given  by  the  Born  rule. 

A  common  objection  to  Everettian  quantum  mechanics  is  that  there  is  no  way  for  probabilities 
to  arise  in  a  deterministic  theory  when  the  entire  physical  state  and  the  laws  are  known.  Consider 
Once-OR-Twice.  At  ti  before  the  measurements  have  been  made,  Alice  knows  what  will 
happen.  She  will  have  a  successor  who  sees  that  particle  a  was  measured  to  be  down  and  two 
who  see  up.  If  she  knows  the  universal  wave  function,  there  is  nothing  for  her  to  be  uncertain 
of.  Of  course,  as  was  discussed  in  §2.2,  she  will  experience  self-locating  uncertainty  at  £2  and  £3. 
But,  at  ti  there  is  no  uncertainty.  Still,  it  may  be  that  Alice  should  act  as  if  she  was  uncertain 
(perhaps  there  is,  as  Vaidman  [2011],  [2014a]  would  say,  an  ‘illusion  of  probability’).  Regardless 
of  how  Alice  is  to  act  at  t\,  when  the  data  is  collected  and  the  theories  are  tested,  between  1 3 
and  £4,  the  requisite  uncertainty  is  present  and  confirmation  can  proceed  as  usual.  Here  we 
follow  Greaves  ([2004])  in  admitting  that  Everettian  quantum  mechanics  lacks  pre-measurement 
subjective  uncertainty  but  arguing  that  it  is  not  thereby  refuted.28  The  core  problem  of  this 
section,  whether  post-measurement  pre-observation  probabilities  are  sufficient  for  solving  the 
practical  and  epistemic  problems,  has  been  discussed  by  Tappenden  ([2011])  and  we  are  largely 


28 Although  we  will  not  explore  the  option  here,  one  could  try  to  revive  some  notion  of  pre-measurement 
uncertainty  which  would  be  present  even  in  cases  where  the  wave  function  is  known;  strategies  include  conceiving 
of  persons  as  four-dimensional  worms,  taking  Everettian  worlds  to  be  diverging  as  opposed  to  overlapping,  and/or 
trying  to  most  charitably  interpret  words  like  ‘uncertainty’  as  uttered  by  agents  who  unknowingly  reside  in  an 
Everettian  multiverse  (see  Saunders  &  Wallace,  [2008];  Saunders,  [2010b];  Wilson,  [2012];  Wallace,  [2012],  ch. 
7).  Our  proof  of  the  Born  rule  post-measurement  could  potentially  be  combined  with  such  an  account  to  justify 
setting  the  pre-measurement  probabilities  in  alignment  with  the  Born  rule.  However,  it  is  not  clear  that  true 
pre- measurement  uncertainty  is  needed  to  solve  the  practical  or  epistemic  problems.  Although  there  may  be  some 
way  of  making  sense  of  pre-measurement  probabilities,  we  hope  to  show  that  our  approach  does  not  rely  on  the 
success  of  such  a  program  (here  we  adopt  the  strategy  of  Greaves,  [2007a],  §1.3). 
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in  agreement  with  his  conclusions. 


5.1  Betting  and  Branching 

Imagine  that,  in  Once-OR-Twice,  at  t\  Alice  is  offered  a  bet  which  costs  $20  and  pays  $50 
if  the  first  measurement  yields  down,  nothing  if  up.  The  net  reward  or  cost  is  assessed  at  1 4. 
Should  Alice  accept  the  bet?  At  t?-  all  of  her  successors  will  wish  that  she  had  taken  the  bet. 
They  each  assign  a  probability  of  0.5  to  up  and  0.5  to  down,  so  the  expected  value  of  the  bet 
is  $5.  At  £3,  the  expected  value  of  the  bet  is  the  same  and  again  all  of  Alice’s  successors  will 
wish  she  had  taken  it.  Alice  knows  ahead  of  time  that  although  she’ll  gain  no  new  information 
about  the  outcome  as  time  progresses,  once  the  measurement  has  occurred  she  will  wish  she’d 
taken  the  bet.  So,  it  seems  reasonable  that  at  t\  she  should  gamble  in  the  way  her  future  selves 
will  wish  she  had  and  accept  the  bet.  Generalizing  this  reasoning,  Alice  should  always  bet  as 
if  one  outcome  were  going  to  occur  with  probability  given  by  the  Born  rule  since  during  the 
inevitable  post-measurement  pre-observation  period  all  of  her  successors  will  approve  of  choices 
made  under  this  supposition.  (This  type  of  argument  is  presented  and  assessed  in  Tappenden, 
[2011].) 

When  Alice  decides,  pre- measurement,  to  accept  the  bet,  she  is  not  doing  so  because  she  is 
trying  to  make  a  decision  under  uncertainty.  Instead,  she  is  trying  to  make  a  decision  about  how 
to  distribute  goods  among  her  successors.  If  she  distributes  goods  as  recommended  above,  each 
successor  will  think  the  decision  reasonable  before  they  come  to  know  which  successor  they  are. 
Once  they  come  to  realize  which  branch  they’re  on,  they  may  well  not  endorse  the  bet.  This 
should  not  be  surprising.  Even  the  most  careful  gambler  can  lose  and  in  such  cases  would  prefer 
that  the  bet  was  never  made. 

The  strategy  for  arriving  at  effective  pre-measurement  uncertainty  outlined  above  has  three 
distinct  shortcomings.  First,  the  claim  that  one’s  pre-measurement  decisions  must  align  with 
the  post-measurement  preferences  of  one’s  successors  must  be  justified.  This  alignment  might 
be  enforced  by  some  sort  of  decision-theoretic  reflection  principle,  but  such  a  principle  would 
need  to  be  precisely  stated  and  defended.29  Second,  this  strategy  only  works  if  the  preferences  of 
agents  are  narrow:  each  successor  only  cares  about  what’s  happening  in  their  own  branch.30  If 
each  of  Alice’s  successors  only  cares  about  their  own  wealth,  then  the  bet  described  above  seems 
lucrative.  There  is  a  50%  chance  of  winning  $30  and  a  50%  chance  of  losing  $20.  The  probability 
of  being  on  a  certain  branch  acts,  for  all  practical  purposes,  like  the  probability  that  a  certain 


29Some  work  has  already  been  done  in  this  direction.  Wallace  ([2002],  §8.1)  introduces  a  decision-theoretic 
reflection  principle  which  is  discussed  in  Greaves  ([2004],  §4.2)  and  Greaves  ([2007a],  §5.2.1).  The  requirement  is 
roughly  that  one’s  betting  behavior  ought  not  change  over  a  period  of  time  in  which  no  new  evidence  is  gained 
(and  this  is  coupled  with  the  idea  that  the  occurrence  of  an  expected  branching  event  provides  no  new  evidence). 
Wallace  ([2010c], [2012])  defends  a  requirement  of  diachronic  consistency  which  does  similar  work.  Unfortunately, 
both  of  these  principles  condemn  agents  we  take  to  be  acting  rationally  in  various  cases  where  self-locating 
uncertainty  is  important  (such  cases  often  violate  epistemic  reflection  principles,  see  Lewis,  [2009a] ;  Arntzenius, 
[2003]).  For  example,  the  principles  condemn  an  agent  who  accepts  the  Duplicating  Dr.  Evil  Dutch  book  in 
appendix  A  as  the  agent’s  betting  behavior  changes  once  the  duplicate  is  created.  We  thus  believe  that  these  two 
principles  cannot  be  correct  as  stated. 

30Tappenden  ([2011])  is  aware  of  this  problem  (which  was  raised  in  the  context  of  anticipated  branching  by 
Price,  [2010]). 
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thing  is  happening.  But,  if  the  successors  care  about,  say,  the  average  wealth  of  all  successors, 
then  at  t2  it  seems  like  a  good  bet  but  at  t%  it  does  not.  (Here  we’ve  assumed  an  idealized  case,  as 
in  §2.3,  where  the  number  of  successors  is  well-defined.  Removing  the  idealization  makes  decision 
making  harder,  but  doesn’t  fix  the  problem  that  one’s  preferences  may  extend  beyond  the  goings 
on  in  one  particular  branch.  For  example,  one  might  conceivably  desire  that  no  copy  of  oneself 
elsewhere  in  the  multiverse  be  experiencing  a  truly  miserable  life  (Price,  [2010],  §7).)  Third,  the 
strategy  outlined  above  does  not  directly  address  cases  where  there  is  no  period  of  uncertainty 
at  all  because  observation  happens  immediately  upon  measurement.  We  believe  such  cases  are 
atypical,  but  perhaps  possible  (§2.2).  In  such  a  case,  one  can  use  the  strategy  above  to  argue 
that  had  there  been  a  period  of  uncertainty  of  any  length  at  all,  it  would  have  been  rational  to 
treat  the  psi-squared  branch  weights  as  probabilities  for  the  sake  of  decision  making — to  care 
about  one’s  successors  in  proportion  to  the  weights  of  the  branches  they  occupy.  As  the  limit  of 
the  amount  of  time  after  measurement  before  observation  is  taken  to  zero,  the  same  decisions 
remain  rational.  It  would  be  very  odd  if  they  were  not  the  right  actions  to  take  when  there  is 
no  period  of  uncertainty  at  all.31 

5.2  Theory  Confirmation 

For  the  purposes  of  empirically  testing  competing  theories — about  the  state  of  the  system  or  the 
laws  that  govern  its  evolution — it  is  necessary  that  when  the  data  come  in  we  can  judge  whether 
the  data  are  considered  probable  or  improbable  by  the  various  theories  under  consideration. 
That  is,  immediately  before  the  agent  observes  the  result,  probabilities  of  different  outcomes 
must  be  well-defined.  However,  it  is  inessential  that  there  be  well-defined  probabilities  before 
the  measurement  is  made  and  the  outcome  is  recorded  by  the  measuring  device  (the  point  at 
which  the  wave  function  branches) .  Post-measurement  pre-observation  probabilities  are  sufficient 
for  theory  confirmation.  On  our  account,  immediately  before  looking  at  the  outcome  of  an 
experiment  an  agent  in  an  Everettian  multiverse  is  uncertain  of  what  the  observed  outcome  will 
be  and  is  perfectly  capable  of  quantifying  that  uncertainty.  Theories  can  be  tested  in  familiar 
ways. 

Although  theory  testing  in  Everettian  QM  proceeds  essentially  as  usual,  we  discuss  two  kinds 
of  learning  scenarios  here  for  the  sake  of  illustration.  First,  consider  testing  theories  about  the 
wave  function  of  the  system  by  gathering  data  about  which  eigenvalues  are  measured.  We  will 
treat  the  problem  from  a  Bayesian  perspective,  but  one  could  apply  alternative  methods  (see 
Wallace,  [2012],  §6.2  &  6.3).  For  simplicity,  consider  a  case  where  there  are  just  two  theories 
under  consideration: 

What  Wave  Function?  Alice  will  measure  the  z-spin  of  a  single  particle  which  she  knows  to 


31A  similar  limiting  argument  is  made  by  Greaves  ([2007a],  §5.2.1),  and  another  appealing  to  a  hypothetical 
period  of  uncertainty  by  Tappenden  ([2011],  §4) 
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be  prepared  in  either  state  ‘Pf’  (‘probably  up’)  or  state  ‘Pj,,’ 


I^Pt)  = 
I^Pt)  = 


(43) 


Before  the  experiment,  she  considers  either  wave  function  equally  likely.  Let  t\  be  a  time 
after  preparation  before  measurement,  £2  be  post-measurement  pre-observation,  and  f3  be 
immediately  post-observation. 


On  our  account  it  is  not  hard  to  see  that  observing  an  up  result  confirms  At  t2,  Alice  assigns 
conditional  probabilities  Pt2  (tzI^Pt)  =  0-9  and  Pt2  (tzI'J'Pj.)  =  0.1.  As  she  knows  branching 
has  occurred,  these  are  the  probabilities  she  assigns  to  being  on  an  up  branch  conditional  on 
particular  initial  wave  functions.  If  upon  observation  she  sees  that  the  spin  was  up,  she  should 
update  her  credence  in  4' p-j-  by  conditionalizing  on  her  new  evidence, 


_ Pt2  (tzl'frpf)  pt2  (ifrpt) _ 

Pt2  (tzl'hp-t-)  Pt2  (’J'pf)  +  Pt2  (-tz\*Pl)Pt2  ('Fpj.) 

_ 0-9x0-5 _ =  0  9 

0.9  x  0.5 +  0.1  x  0.5 


(44) 


The  credence  Alice  assigns  to  ’Up-)-  jumps  from  0.5  to  0.9,  as  it  should.  Continuing  to  observe 
more  up  results  than  down  in  identically  prepared  systems  would  further  confirm  the  ’Lpi  hy¬ 
pothesis  over  Tp^..  In  analyzing  this  case  we’ve  assumed  that  Alice  should  update  as  usual,  by 
(44),  even  though  some  of  the  probabilities  involved  quantify  her  self- locating  uncertainty,  like 
Pt2  (tzl’l'pt).32 

Next,  consider  a  second  type  of  learning  scenario.  We  can  empirically  confirm  Everettian 
quantum  mechanics  over  competing  empirically  inequivalent  hypotheses  about  the  physical  laws 
by  observing  Born  rule  statistics.  Suppose  we  run  a  large  number  of  experiments  without  ob¬ 
serving  the  outcomes  (one  could  equally  well  look  after  each  experiment,  but  this  way  things  will 
be  a  bit  simpler) .  Some  sequences  of  results  would  be  deemed  likely  by  the  Born  rule  and  others 
would  be  judged  very  improbable.  Suppose  that  the  actual  sequence  of  outcomes,  is  one  that 
the  Born  rule  judges  likely.  For  example,  imagine  that  of  a  large  number  of  measurements  of 
A  on  systems  prepared  in  the  same  state  |4/),  the  fraction  in  which  eigenvalue  a  is  observed  is 
approximately  |(a|’3/)|2.  Then,  the  probability  of  the  evidence  given  the  theory,  P(oS^|EQM&\l/), 
is  high  for  Everettian  quantum  mechanics,  EQM,  because  Everettian  probabilities  match  the 
Born  rule  probabilities  (§3.2).  If  another  theory  considered  such  sequences  to  be  less  probable, 


■i2There  is  a  way  in  which  the  case  of  theory  testing  here  is  different  from  normal  cases.  On  either  of  the  two 
hypotheses  about  the  wave  function,  it  was  guaranteed  that  a  version  of  Alice  would  see  spin  up.  This  leads  to 
a  worry:  What  Alice  learns  upon  seeing  up  is  that  ‘in  my  universe,  one  of  the  copies  saw  up’ — which  doesn’t 
provide  evidence  for  either  theory  over  the  other  as  Alice  already  knew  with  certainty  that  it  would  happen — and 
that  ‘I’m  the  one  who  saw  up’ — which  is  purely  self-locating  information  about  where  she  is  in  the  universe  and 
thus  not  informative  about  the  way  the  universe  is.  We  side  with  Titelbaum  ([2008])  and  Greaves  &  Myrvold 
([2010],  pp.  294-295)  in  rejecting  the  final  step,  ‘purely  self- locating  evidence’  can  be  informative  about  the  way 
the  world  is.  For  a  theory  about  learning  from  evidence  that  disagrees  on  this  point,  see  Meacham  ([2008]). 
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then  updating  on  5?  would  support  Everettian  quantum  mechanics  over  this  competing  theory. 
If  the  alternative  theory  assigned  the  same  probability  to  5? ,  the  data  would  not  discern  between 
the  two  theories. 

In  general,  confirmation  will  work  as  usual  in  cases  where  Alice  has  a  period  of  uncertainty 
before  the  outcome  is  revealed  to  her.  What  happens  if  in  What  Wave  Function?  Alice  sees 
that  the  particle  is  tz  exactly  when  the  branching  occurs  so  that  there  is  no  post-measurement 
pre-observation  period?  This  is  a  tricky  question.  Alice  might  try  to  update  her  credences 
immediately  after  branching  by  conditionalization  as  in  (44)  (now  moving  directly  from  the 
pre-measurement  time  t\  to  the  post-observation  time  tj,  as  there  is  no  post-measurement  pre¬ 
observation  period), 


pt3  (^Pt)  = 


_ pt,  (tz|^Pf)Ptl  (*Pf) _ 

Ptl  (tzI^Pt)  Pti  (*Pt)  +Ptl  (tzI^Pi)^  t*Pl)  ■ 


(45) 


In  this  equation,  the  probability  Ptl  (tzl^pf)  is  hard  to  interpret.  It’s  definitely  not  the  prob¬ 
ability  that  the  particle  was  up  given  41 p|  at  t,\  since  the  particle  was  in  a  superposition  of  up 
and  down  before  the  measurement  was  made.  It  cannot  be  Alice’s  pre- measurement  probability 
that  lI  will  see  up’  given  that  the  particle  is  currently  in  state  4/p^  since  the  claim  is  ill-defined; 
some  successors  will  see  up  and  others  will  see  down,  all  are  Alice’s  descendants.  It  should  not 
be  interpreted  as  the  probability  that  some  successor  of  Alice  sees  up.  In  cases  where  stochastic 
theories  are  under  consideration  this  proposal  would  make  Everettian  QM  too  easy  to  confirm 
all  outcomes  that  one  might  observe  are  given  probability  one  (see  Greaves,  [2007a],  p.  140). 
Standard  Bayesian  methods  break  down.  The  question  of  how  to  revise  Bayesian  confirmation 
theory  is  difficult  and  numerous  proposals  have  been  made — many  of  which  rely  on  Elga’s  indif¬ 
ference  principle  and  thus  are  incompatible  with  ESP  (see  footnote  2.2).  To  thoroughly  address 
the  question  of  how  to  update  when  the  outcome  is  immediately  observed  upon  branching,  we 
must  await  philosophical  progress.33  There  is  reason  to  be  optimistic.  Plausibly,  in  the  case 
of  immediate  observation  one  should  adjust  their  beliefs  in  the  way  they  would  if  there  were  a 
short  period  of  self-locating  uncertainty  as  the  adjustment  is  the  same  no  matter  how  short  the 
period  of  self-locating  uncertainty  is  (Tappenden,  [2011],  p.  107). 


6  Comparison  to  Other  Approaches 

The  main  purpose  of  this  article  is  to  present  our  derivation  of  the  Born  rule.  It  is  consistent  with 
the  success  of  our  derivation  that  other  attempts  to  derive  the  Born  rule  are  also  satisfactory. 
However,  there  would  have  been  little  motivation  to  embark  on  this  project  if  we  did  not  have 


330ne  appealing  recent  suggestion  is  to  replace  Pt1  (tzI^Pt)  with  r(\I/p^  — >T.z)  where  r(\I/p-f  -A  tz)  is  a 
‘transition  probability,’  capturing  something  like  the  degree  to  which  the  copies  of  Alice  on  branches  are 
Alice’s  successors  if  the  initial  wave  function  is  ^p^;  or,  more  operationally,  specifying  how  much  of  Alice’s 
credence  in  \I/ p^  should  shift  to  the  fz  branches  at  £3  before  she  takes  account  of  any  new  evidence  (Schwarz, 
[forthcoming]).  If  Schwarz’s  suggestion  is  adopted,  one  could  argue  that  the  transition  probabilities  should  be 
given  by  the  amplitude-squared  of  the  wave  function  by  requiring  that  belief  update  when  there  is  no  period 
self-locating  uncertainty  agree  with  the  case  of  an  arbitrarily  small  period  of  self-locating  uncertainty. 
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concerns  about  existing  approaches.  In  this  section  we  will  highlight  the  differences  between 
our  derivation  and  two  existing  programs:  Zurek’s  envariance-based  derivation  and  the  decision- 
theoretic  approach. 

6.1  Zurek’s  Envariance-based  Derivation  of  the  Born  Rule 

Here  we  will  briefly  present  Zurek’s  ([2005])  argument  in  the  simplest  case  and  mention  some  of 
the  limitations  of  his  approach. 

According  to  Zurek,  the  probabilities  for  a  system  S  to  manifest  various  properties  upon 
measurement  depend  only  on  the  state  of  the  system  (‘Fact  2’).  Further,  the  state  of  a  system  is 
not  affected  by  unitary  transformations  on  the  environment  (‘Fact  1’).  From  these  assumptions, 
one  can  prove  that  the  probabilities  for  different  measurement  outcomes  are  in  agreement  with 
the  Born  rule  in  ordinary  scenarios.  For  example,  suppose  the  state  of  the  universe  is 

^(ltz>s|£i}  +  |l*>s|£2>)  ,  (46) 

where  E\  and  E2  are  orthogonal.  It  is  taken  to  follow  from  the  perfect  entanglement  between 
system  and  environment  that  P(f2)  =  P{Ef).  The  following  universal  state  can  be  reached  by 
a  unitary  swap  of  the  environment  states  (or  a  unitary  swap  of  the  system  states), 

^(iWsl^  +  IUsl^a))  ■  (47) 

This  swap  does  not  change  the  state  of  the  system  or  the  environment,  it  only  affects  the  entan¬ 
glement  between  the  two.  The  perfect  entanglement  of  (47)  yields  P(iz)  =  P{Ef).  Combining 
these  two  results  gives  P(fz)  =  P(-lz)>  in  agreement  with  the  Born  rule.34 

Zurek  has  provided  a  compelling  argument  that  the  Born  Rule  gives  the  only  sensible  prob¬ 
ability  measure  in  Everettian  quantum  mechanics.  However,  that  has  arguably  already  been 
established  by  Gleason’s  Theorem  (Gleason,  [1957]).  What  Zurek  fails  to  explain  is  how  proba¬ 
bilities  arise  at  all  in  this  deterministic  theory — self-locating  uncertainty  is  not  discussed.  Thus 
in  his  derivation  the  nature  of  the  probabilities  involved  is  obscure.  Zurek  claims  that  he  is 
calculating  probabilities  for  future  measurement  outcomes,  but  he  does  not  explain  clearly  what 
these  are  probabilities  of,  saying  that  the  ‘observer  can  be  ignorant  of  his  future  state,  of  the 
outcome  of  the  measurement  he  has  decided  to  carry  out’  (Zurek,  [2005],  §VII.C).35  This  would 
be  reasonable  if  only  one  outcome  were  expected  to  occur.  However,  as  explained  in  §2.2,  it  is 
not  obvious  that  there  is  anything  for  one  to  assign  probabilities  to  in  the  many-worlds  interpre¬ 
tation  when  all  outcomes  will  occur  and  the  future  evolution  of  the  wave  function  is  known.36 

34In  our  exposition  of  (Zurek,  [2005],  §11. C)  we  take  option  3  since  it  seems  strongest  and  is  most  similar  to 
the  reasoning  in  our  derivation. 

35 See  also  FAQ  #5  of  (Zurek,  [2010]). 

36 Zurek  ([2005],  §III.C)  recognizes  this  challenge  and  responds  by  departing  from  the  many-worlds  interpretation 
and  discussing  his  Existential  interpretation  instead.  This  raises  doubts  about  the  applicability  of  his  proof  to 
Everettian  quantum  mechanics. 

Wallace  ([2012],  ch.  7,  esp.  §7.6)  argues  that  we  should  expand  our  concept  of  subjective  probability  so  that 
one  can  not  only  assign  probabilities  to  ways  the  world  might  be  and  locations  they  might  have  in  it,  but  also 
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One  could  reasonably  argue  that  Facts  1  and  2  are  insufficiently  motivated  in  Zurek’s  treat¬ 
ment,  especially  given  that  the  probabilities  involved  are  not  the  usual  sort.37  Although  our 
derivation  relies  on  an  assumption  as  well,  ESP-QM  (which  is  similar  to  Facts  1  and  2  taken 
together),  we  have  attempted  to  provide  a  more  thorough  justification  of  our  assumption  and  a 
more  philosophically  careful  treatment  of  the  probabilities  involved. 

6.2  The  Decision-theoretic  Program 

Starting  with  (Deutsch,  [1999]),  there  have  been  a  variety  of  attempts  to  justify  Born  rule 
probabilities  using  decision  theory  (Greaves,  [2004],  [2007a];  Greaves  &  Myrvold,  [2010];  Wallace, 
[2003b];  [2010c],  [2012];  Wilson,  [2013]).  The  basic  strategy  is  to  argue  that  plausible  constraints 
on  rational  preferences  ensure  that  when  we  make  bets  about  the  outcomes  of  future  quantum 
measurements  we  will  act  as  if  we  assign  Born  rule  probabilities  to  the  various  branches.  The 
success  of  their  program  is  compatible  with  the  success  of  our  purely  epistemic  arguments.  In 
fact,  the  mathematical  methods  used  to  derive  the  Born  rule  are  quite  similar.38  At  this  point, 
many  remain  unconvinced  by  the  decision-theoretic  program  (Baker,  [2007];  Albert,  [2010];  Price, 
[2010];  Dizadji-Bahmani,  [forthcoming];  Maudlin,  [2014]).  We  focus  on  one  particular  type  of 
concern  here  to  highlight  an  advantage  of  our  approach. 

The  constraints  proposed  in  the  decision-theoretic  approaches  may  reasonably  be  doubted  as 
they  manifestly  conflict  with  a  prima  facie  reasonable  epistemic  principle,  Indifference.  Deutsch’s 
original  formulation  of  the  decision-theoretic  argument  for  the  Born  rule  implicitly  appeals  to 
the  following  assumption  (Wallace,  [2003b]): 

Measurement  Neutrality.  A  rational  agent  is  indifferent  between  any  two  quantum  bets 
that  agree  on  the  state  |\t)  on  which  the  measurement  is  to  be  performed,  the  observable 
X  to  be  measured,  and  the  ‘payoff  function’  P  from  the  spectrum  of  X  to  the  set  of 
consequences.  (Greaves,  [2007b],  p.  119) 

Wallace  ([2007])  has  proved  that  Measurement  Neutrality  is  equivalent  to: 

Equivalence.  A  rational  agent  is  indifferent  between  any  two  quantum  bets  that  agree, 
for  each  possible  reward,  on  the  mod-squared  measure  of  branches  on  which  that  reward  is 
given.  (Greaves,  [2007b],  p.  119) 

The  arguments  in  (Deutsch,  [1999];  Greaves,  [2004],  [2007b];  Wallace,  [2003b])  appeal  to  this 
assumption  in  one  of  its  manifestations.  Measurement  Neutrality  may  have  some  intuitive  plau¬ 
sibility  but,  looking  at  Equivalence,  it  is  obvious  that  this  assumption  will  be  inconsistent  with  a 
branch-counting  rule  for  assigning  probabilities.  The  principle  seems  to  beg  the  question  against 


to  different  futures  within  a  branching  Everettian  multiverse.  Doing  this  would  help  preserve  the  correctness 
of  language  used  by  agents  in  Everettian  worlds.  Wallace  may  be  right  that  such  an  extension  is  possible  and 
advisable,  but  to  the  extent  that  it  is,  it  requires  a  significant  change  in  the  way  we  understand  probability.  In 
this  new  context,  fact  2  is  far  from  obvious.  Wallace’s  derivation  of  the  Born  rule  is  not  susceptible  to  similar 
concerns  as  his  assumptions  are  about  the  preferences  of  ideal  agents,  not  their  probability  assignments. 

37See  also  (Schlosshauer  &;  Fine,  [2005],  esp.  §III.F2;  Albert,  [2010],  §3.1). 

38 Compare  Wallace’s,  [2012],  §5.5,  use  of  erasure  to  ignore  irrelevant  details  and  our  use  of  ESP-QM  in  §3.2  to 
do  the  same. 
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the  defender  of  Indifference  who  takes  branch-counting  to  be  the  proper  way  to  assign  proba¬ 
bilities  in  Everettian  quantum  mechanics.  Defenders  of  the  decision-theoretic  program  typically 
point  out  that  the  conflict  with  branch-counting  should  not  cause  doubt  about  Equivalence  since 
there  are  independent  reasons  to  think  branch-counting  is  wrong.  Two  main  reasons  are  given. 
First,  branches  cannot  generally  be  counted  so  branch-counting  is  not  a  rule  one  could  actually 
apply.  As  discussed  in  §2.3,  there  are  cases,  albeit  somewhat  contrived,  where  branch  number 
is  well-defined  and  in  these  cases  branch-counting  gives  definite  recommendations  which  are  in 
conflict  with  the  Born  rule.  The  fact  that  it  is  unclear  how  to  apply  a  rule  in  some  cases  is 
not  necessarily  a  reason  to  think  it  incorrect  when  it  yields  a  clear  judgment.  Second,  branch¬ 
counting  is  diachronically  inconsistent.  We  do  not  take  this  to  be  a  strong  reason  to  reject 
branch-counting  (see  appendix  A). 

In  more  recent  decision-theoretic  derivations  of  the  Born  rule,  Wallace  ([2010c],  [2012])  intro¬ 
duces  a  somewhat  different  collection  of  rationality  axioms,  multiple  of  which  might  be  contested 
by  the  defender  of  branch-counting.  Wallace  argues  in  favor  of  these  axioms  and  does  not  take 
their  conflict  with  branch-counting  (Wallace,  [2012],  §5.8.1)  as  a  reason  to  think  them  incorrect, 
since  branch-counting  can  be  shown  to  be  irrational.  For  one  who  believes  his  axioms,  this 
gives  reason  to  doubt  branch-counting.  But  for  the  proponent  of  Indifference ,  it  gives  reason 
to  doubt  Wallace’s  axioms.  We  also  reject  Indifference  because  it  conflicts  with  an  assumption 
about  rationality  ( ESP ).  However,  we  take  our  argument  to  be  potentially  more  persuasive  to 
the  proponent  of  Indifference  for  two  main  reasons:  ESP  is  a  single,  simple,  general  epistemic 
principle  which  we  believe  has  similar  (if  not  more)  initial  plausibility  than  Indifference ;  ESP 
is  consistent  with  Indifference  (and  Strong  ESP  entails  it)  in  cases  of  classical  self-locating  un¬ 
certainty  and  thus  it  is  capable  of  explaining  why  (and  how)  quantum  cases  are  to  be  handled 
differently  (though  see  footnote  19).  A  strength  of  our  approach,  over  both  the  decision-theoretic 
program  and  Zurek’s  derivation,  is  that  it  can  be  applied  in  both  cases  of  classical  duplication 
and  in  cases  of  quantum  branching. 


7  Conclusion 

In  this  paper  we  have  presented  a  justification  for  the  Born  rule  in  Everettian  quantum  me¬ 
chanics  in  which  self-locating  uncertainty  played  a  fundamental  role.  The  policy  of  reacting  to 
self-locating  uncertainty  by  treating  each  observer  as  equiprobable,  Indifference ,  is  reasonable 
in  classical  scenarios  but  yields  strange  recommendations  in  quantum  contexts.  Instead  we  pro¬ 
posed  the  Strong  Epistemic  Separability  Principle  ( Strong  ESP),  which  extends  smoothly  from 
Indifference  in  the  classical  regime  to  the  Born  rule  in  cases  of  quantum  measurement. 

Following  the  consequences  of  Strong  ESP  leads  us  to  a  simple  and  physically  transparent 
derivation  of  the  Born  rule  in  the  many-worlds  interpretation.  The  appearance  of  probabilities 
in  a  deterministic  theory  is  explained  by  evolution  from  perfect  knowledge  to  unavoidable  self- 
locating  uncertainty.  Our  approach  provides  a  unified  perspective  on  uncertainties  in  both  the 
classical  and  quantum  contexts,  with  implications  for  large-universe  cosmology  as  well  as  for  the 
foundations  of  quantum  mechanics. 
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There  are  a  few  ways  in  which  future  work  could  strengthen  the  approach  to  probability 
in  Everettian  quantum  mechanics  developed  in  this  article.  First,  although  Strong  ESP  has 
some  intuitive  appeal,  one  could  seek  further  philosophical  justification  for  the  principle  and  a 
more  precise  formulation  of  it.  Second,  more  work  could  be  done  in  justifying  the  appeal  to 
reduced  density  matrices  as  the  correct  way  of  representing  subsystems  in  Everettian  quantum 
mechanics.  Third,  one  might  hope  to  reach  more  definitive  conclusions  about  the  connection 
between  post-measurement  probabilities  and  pre-measurement  decision  making.  Fourth,  the 
assumption  in  §2.2  that  branching  happens  globally  could  be  further  defended  in  the  context  of 
a  detailed  account  of  when  and  how  branching  occurs. 
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A  What’s  Not  Wrong  with  Branch-counting 

Wallace  ([2012],  §4.3)  gives  the  following  diagnosis  of  why  the  switch  from  Pt2(up)  =  t  to 
Pt3( up)  =  |  in  Once-OR-Twice  is  irrational:  it  is  diachronically  inconsistent.39  According  to 
‘the  standard  rules  of  the  probability  calculus,’  the  probability  of  particle  a  being  up  at  ought 
to  be 

Pt3(up  at  £3)  =  Pt2( up  at  ^3 1 up  at  t2)Pt2(np  at  t2)  +  Pt2(up  at  t3|down  at  t2)Pt2(down  at  £2)  • 

(48) 

But,  it  clearly  is  not  if 

2 

Pt3( up  at  t3)  =  - 

Pt2  (up  at  £3 1 up  at  t2)  =  1 
Pt2( up  at  £3 1 down  at  t2)  =  0 

Pt2(up  at  t2)  =  Pt2  (down  at  t2)  =  ^  ,  (49) 

,i9He  credits  Deutsch  with  giving  a  version  of  the  argument  in  conversation  (Wallace,  [2013],  footnote  15). 
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as  recommended  by  Indifference.  Without  the  subscripts,  the  point  is  certainly  valid.  At  £2  or 
£3  (or  any  other  time),  it  must  be  the  case  that 

P(up  at  f3)  =  P(up  at  t3|up  at  t2)P(up  at  tf)  +  P( up  at  t3|down  at  i2)P(down  at  tf)  ,  (50) 

provided  ‘up  at  tf  and  ‘down  at  f2’  are  mutually  exclusive  and  exhaustive  alternatives.  This 
forbids  certain  unsophisticated  ways  of  filling  in  the  branch-counting  story,  but  is  of  no  concern 
to  the  defender  of  Indifference.  At  i2,  P( up  at  i3)  is  only  1/2.  At  t3,  P(up  at  tf)  has  risen  to 
2/3. 

(48)  is  not  a  requirement  of  consistency  for  one’s  credences  at  a  time,  it  is  a  constraint  on  the 
way  one  ought  to  adjust  their  credences  over  time  and  not  one  that  a  proponent  of  Indifference 
should  accept.  At  i2  Alice  knows  that  there’s  a  50/50  chance  that  she’s  on  an  up  branch  and 
that  if  she  is  on  one  she’ll  definitely  still  be  on  one  later.  However,  once  the  second  measurement 
is  made  she  realizes  there  are  three  branches  she  might  be  on  and  that  none  of  her  evidence 
discerns  between  the  three  possibilities.  So,  by  the  logic  of  Indifference ,  she  should  think  it  more 
likely  that  she’s  on  an  up  branch  since  there  are  twice  as  many  of  them. 

Wallace  ([2012],  §5.4)  rightly  notes  that  because  of  the  prevalence  of  branching  in  Everettian 
quantum  mechanics,  such  credence  shifts  will  be  so  common  that  deliberative  action  will  be  near 
impossible.  Such  are  the  dangers  of  living  in  a  wildly  branching  multiverse. 

The  violation  of  (48)  can  be  made  more  worrisome  as  it  leads  to  an  Everettian  Dutch  book.40 
If  Alice  reasons  by  Indifference ,  there  are  a  series  of  bets  which  can  be  given  to  her  and  her 
successors  such  that  each  bet  will  be  judged  fair  but  the  combination  of  bets  guarantees  a  loss 
on  every  branch.  Here  is  a  way  of  constructing  the  Dutch  book  (from  Peterson,  [2011],  §2):  At  i2 
a  bet  is  offered  which  pays  $15  if  particle  a  is  down  and  — $15  if  up.  At  f3  a  bet  is  offered  which 
pays  $10  if  particle  a  is  up  and  — $20  if  down.  All  costs/rewards  will  be  collected/paid  at  £4.  If 
Alice  assigns  probabilities  as  in  (49),  these  bets  will  seem  fair.  However,  if  she  accepts  both  bets 
then  she  will  lose  $5  on  each  branch.  There  is  an  apparent  inconsistency  here.  Looking  forward 
from  t\  Alice  can  see  that  this  series  of  bets  will  guarantee  her  a  loss  and  thus  would  presumably 
choose  not  to  take  them  if  offered  as  a  package  before  the  branching  starts.  However,  as  she  goes 
through  the  experiment  and  her  credences  shift  (in  a  way  which  she  could  have  anticipated),  she 
finds  each  bet  fair  when  offered. 

The  defender  of  Indifference  should  not  be  perturbed.41  Consider  this  potential  Dutch  book 
for  Dr.  Evil  in  the  Duplicating  Dr.  Evil  case:  Before  t  (that  is,  before  the  duplication), 
Dr.  Evil  is  offered  a  bet  that  some  time  long  after  t  will  pay  out  $100  to  Dr.  Evil  and  —$300 
to  Dup.  Not  long  after  t.  he  is  offered  a  second  bet  which  will  pay  out  $200  to  Dup  and  —$200 
to  Dr.  Evil.  If  both  bets  are  accepted,  Dr.  Evil  and  Dup  will  each  lose  $100.  The  first  bet 
seems  lucrative.  Since  Dup  has  yet  to  come  into  existence,  Dr.  Evil  can  be  sure  he’s  not  the 
duplicate.  The  second  bet  is  fair  by  Indifference.  Thus  it  turns  out  that  accepting  Indifference 

40 Wallace  would  likely  see  this  as  a  friendly  strengthening  of  his  quick  argument  (see  Wallace,  [2012],  §4.8 
together  with  Wallace,  [2010b],  p.  247).  For  an  alternate  strategy,  see  (Wallace,  [2010b],  §11). 

41Thanks  to  David  Manley  for  essential  discussion  here.  See  (Lewis,  [2009a])  for  a  related  point. 
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in  the  original  case  where  the  principle  was  most  plausible  already  leads  to  a  diachronic  Dutch 
book.  The  defender  of  Indifference  finds  the  recommendations  of  the  principle  reasonable  in 
cases  like  Duplicating  Dr.  Evil  and  thus  must  reject  the  idea  that  this  kind  of  diachronic 
Dutch  book  demonstrates  irrationality.  The  Everettian  Dutch  book  discussed  above  raises  no 
new  concerns  for  Indifference. 

The  same  point  can  be  expressed  in  another  way.  If  the  fission  in  Once-OR-Twice,  depicted 
in  figure  1,  was  classical  fission  instead  of  quantum  fission  then  assigning  credences  in  accord  with 
Indifference  and  betting  accordingly  would  be  reasonable  (imagine  cases  of  amoeba-like  division 
as  discussed  in,  e.g.,  Parfit,  [1971]).  If  these  beliefs  and  betting  behaviors  are  acceptable  in  the 
classical  case,  why  not  in  the  quantum  case  as  well?  The  charge  of  diachronic  inconsistency  does 
not  raise  distinctively  quantum  problems  for  Indifference. 

B  Circularity 

There  are  reasons  to  be  concerned  that  an  appeal  to  reduced  density  matrices  in  deriving  the 
Born  rule  involves  some  sort  of  illegitimate  circularity  (Zeh,  [1997];  Baker,  [2007];  Schlosshauer, 
[2007],  §8.2.2;  Zurek,  [2003b],  [2005],  [2010];  Kent,  [2010],  §5.4).  This  objection  comes  in  a 
variety  of  forms.  We’ll  consider  three. 

First,  one  might  be  concerned  that  the  very  use  of  inner  products,  partial  trace  operations, 
and  reduced  density  matrices  is  forbidden  until  one  has  derived  a  connection  between  squared 
inner  product  and  probability.  This  cannot  be  right.  How  could  one  derive  such  a  thing  with¬ 
out  ever  writing  an  inner  product?  Partial  trace  operations  and  reduced  density  matrices  are 
perfectly  well-defined  mathematically  within  the  framework  of  quantum  theory.  Their  definition 
does  not  require  understanding  any  number  as  a  probability.  The  concern  about  circularity 
must  then  be  a  concern  about  the  physical  interpretation  of  these  mathematical  objects  (see  also 
Zurek,  [2010],  FAQ  #6). 

Second,  the  idea  that  the  reduced  density  matrix  describes  the  state  of  a  subsystem-  -which 
we  use  in  our  derivation  of  the  Born  rule — could  be  doubted. 

It  is  not  obvious  that  the  reduced  density  operator  for  system  A  is  in  any  sense  a  description 
for  the  state  of  system  A.  The  physical  justification  for  making  this  identification  is  that 
the  reduced  density  operator  provides  the  correct  measurement  statistics  for  measurements 
made  on  system  A.  (Nielsen  &  Chuang,  [2010],  §2.4.3) 

It  is  true  that  the  use  of  reduced  density  matrices  to  describe  subsystems  is  often  motivated  by  the 
fact  that  the  reduced  density  matrix  retains  enough  information  about  the  state  to  deduce  Born 
rule  probabilities  for  outcomes  of  measurements  performed  solely  on  that  system  (Schlosshauer, 
[2007],  §2.4.6;  Nielsen  &  Chuang,  [2010],  box  2.6).  However,  we  believe  an  alternative  justification 
is  possible  which  has  nothing  to  do  with  probabilities  or  measurements.  (We  only  give  a  first 
pass  at  such  a  justification  here.  In  particular,  we  do  not  show  that  reduced  density  matrices 
are  the  only  viable  way  to  mathematically  represent  the  state  of  a  subsystem.42)  In  §3.1  we 

42  Alternatively,  one  might  appeal  to  Zurek’s  envariance-based  argument  for  the  use  of  reduced  density  matrices 
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proposed  that  the  mathematical  representation  of  the  state  of  a  subsystem  should  (1)  together 
with  the  states  of  all  other  subsystems  and  facts  about  the  connections  between  the  subsystems 
yield  the  total  state,  and  (2)  be  sufficient  to  determine  its  own  evolution  when  the  subsystem 
is  isolated.  In  general,  the  reduced  density  matrix  for  a  composite  system  AB ,  pab,  cannot  be 
constructed  uniquely  from  the  separated  reduced  density  matrices  for  A  and  B.  The  matrix  pab 
also  encodes  facts  about  the  entanglement  between  A  and  B.  But,  as  these  are  just  facts  about 
how  A  and  B  are  connected,  condition  (1)  is  satisfied.  To  see  that  condition  (2)  is  met,  suppose 
that  at  least  for  a  time  subsystem  A  is  isolated  from  everything  else,  E.  Let  Ut  be  the  unitary 
operator  that  gives  the  time  evolution  of  the  total  state.  Since  A  and  E  are  non-interacting, 
Ut  =  UA®  Ue •  The  time  evolution  of  pa  is  then  given  by  U a  Pa  U\  (Ue  is  irrelevant). 

Third,  there  is  a  worry  that  we  cannot  assume  a  structure  of  branching  quantum  worlds 
without  assuming  squared-amplitudes  give  probabilities.  Suppose  that  the  reduced  density  ma¬ 
trix  for  a  macroscopic  system  is  approximately  diagonal.  Why  are  we  justified  in  treating  the 
diagonal  terms  as  branches  and  the  off-diagonal  terms  as  somehow  unimportant  and  certainly 
not  branches  themselves?  (See  Baker,  [2007];  brief  reply  in  Wallace,  [2012],  pp.  243-254.)  Our 
derivation  treats  the  probabilities  involved  as  purely  epistemic,  quantifying  an  agent’s  uncer¬ 
tainty  about  which  world  they  are  in.  The  question  of  how  patterns  in  the  wave  function  give 
rise  to  distinct  worlds — and  people  who  can  wonder  about  which  world  they  are  in — is  primarily 
metaphysical.  In  this  article  we’ve  offered  no  additional  insights  on  how  this  important  project 
is  to  be  completed,  choosing  to  work  under  the  assumption  that  it  can  be. 

It  should  be  noted  that  the  use  of  density  matrices,  while  convenient,  is  not  strictly  necessary 
for  our  derivation  of  the  Born  rule.  The  proof  of  the  Born  rule  could  equally  well  be  derived 
using  purely  the  language  of  state  vectors.  Instead  of  starting  from  the  idea  that  the  state  of 
the  environment  is  irrelevant,  we  could  begin  with  the  closely  related  thought  that  changes  in 
the  environment — represented  by  unitary  transformations — ought  not  change  the  probabilities 
one  assigns.  In  Carroll  &  Sebens  ([2014])  we  gave  a  derivation  along  these  lines. 

C  Generalization  of  the  Born  Rule  Derivation 

In  the  main  text  we  showed  how  one  can  derive  the  Born  rule  in  two  simple  cases.  Here  we  extend 
the  arguments  of  §3.2  to  the  more  general  case  of  N  orthogonal  branches  with  arbitrary  rational 
squared-amplitudes  and  arbitrary  phases.  As  the  proof  here  is  restricted  to  rational  squared- 
amplitudes,  one  might  be  concerned  about  whether  the  account  gives  any  advice  at  all  when 
the  squared-amplitudes  are  irrational.  This  is  especially  worrisome  as  there  are  more  irrational 
numbers  than  rational  numbers.  However,  if  we  assume  (plausibly)  that  the  probabilities  vary 
continuously  with  small  changes  in  the  amplitudes,  the  restricted  proof  is  sufficient.  For  any 
wave  function  with  irrational  squared-amplitudes  there  exist  arbitrarily  similar  wave  functions 
with  rational  squared-amplitudes  (as  the  rationals  are  a  dense  subset  of  the  reals). 


without  accepting  his  later  steps  concerning  probabilities  (Zurek,  [2005],  II. B). 
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A  general  state  of  the  above  form  can  be  written  as 

l*o>  =  I R)a  \di)D  \Ei)  +  | R)a  I d2)D  | E2)  +  ...  +  Cfew»  \R) A  \dN)D  \EN) 

N 

=  ^^eie«\R)A\dk)D\ek)E,  (51) 

k= 1 

where  each  ck  >  0,  c|  £  Z+,  9k  £  R,  and  T  =  ^J2k  cl-  I R)a  is  Alice’s  state  which  is  taken 
to  be  the  same  on  every  branch.  \dk)D  are  possible  states  of  the  detector  or  system  of  interest, 
D ,  the  entity  whose  states  Alice  is  assigning  probabilities  to.  | Ek)  are  the  possible  states  of  the 
environment.  Here  Alice  would  like  to  know  what  probabilities  to  assign  to  each  possible  state 
dk.  We  will  show  that  the  appropriate  probability  of  dk  is  (unless  dk  =  dj  for  some  j ,  a 
situation  where  the  particle  is  in  the  same  state  for  two  or  more  different  environments,  in  which 

case  the  probability  is  given  by  the  sum  of  the  amplitude-squared  for  each  j  such  that  dk  =  dj 

c2 

including  j  =  fc,  )A  ^4).  The  probability  that  Alice  ought  to  assign  to  each  pk  will  be  unchanged 
if  we  make  the  following  transformation  on  each  environment  state  (this  follows  from  ESP-QM, 
since  the  transformation  leaves  the  Alice+Detector  reduced  density  matrix  unaffected), 

l-^fc)  *  ~  { \Ek  i)  +  •■■  +  \E'k  2) }  .  (52) 

Cfe  t  ’  k  J 

The  environment  state  Ek  is  taken  to  a  superposition  of  different  environment  states.  These 
transformations  take  H/o  to 

N  cl 

I *eqamP)  =  E  E  Tei<>k  lR)A  ldk)D  1^)  '  (53) 

fc=l  j= 1 

In  this  state,  all  of  the  components  have  equal  amplitude.  Further,  there  are  terms  where 
D  is  in  state  dk  (unless  dk  =  dj  for  some  j,  in  which  case  there  are  JA  c|  terms).  Presently 
we  will  prove  that  each  component  of  T eqamp  has  equal  probability.  Looking  at  the  number  of 
terms  with  each  dkl  it  is  clear  that  this  is  all  that  is  needed  to  show  that  the  probabilities  in  H/o 
for  each  dk  are  given  by  the  Born  rule.  The  above  method  for  moving  from  a  state  of  unequal 
amplitudes  to  one  of  equal  amplitudes  is  also  used  in  (Zurek,  [2005],  §11. D). 

We’ve  reduced  the  problem  to  showing  that  terms  in  equal  amplitude  superpositions  are 
equiprobable.  A  general  state  with  equal  amplitude  terms,  such  as  ^ eqamp,  can  be  written  as 

N  ^ 

I*)  =  E  ^ 6k  I R)a  I dk)D  | Ek)  .  (54) 

Here  some  states  dk  may  be  identical.  Now  consider  two  possible  transformations  which  leave 
the  Alice+Detector  reduced  density  matrix  unchanged.  First,  consider  a  transformation  which 
entangles  N  display  screens  with  each  environment  state  Ek.  The  symbol  on  the  first  screen, 
symbol  1,  may  be  either  S 1  or  S[,  maybe  either  or  <0,  symbol  2  might  be  S2  or  S'2,  etc.  The 
first  transformation  entangles  the  fcth  state  with  N  displays,  most  of  which  show  the  unprimed 
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symbols  but  the  fcth  display  shows  the  primed  symbol: 


\Ek)  — ►  |5r)  |52) ...  |S0  ...  \SN)  \E*k) 


(55) 


This  takes  the  state  to 

l*a>  =  E  -^el9k  I  R)a  \dk)D  ISi)  -  |^>  -  I SN)  I  El)  .  (56) 

The  second  transformation  gives  most  environment  states  the  same  set  of  symbols,  except  the 
iVth  state  which  gets  all  of  the  primed  symbols, 

(  |Sr)  \S2)  \SN)  \E*k*)  iik^N 

\Ek)  — ►  l  .  (57) 

{  \S[)\S'2)...\S'N)\E*k*)  iik  =  N 

This  takes  T  to 

N-l 

l^>  =  E  -^eiSk  I R)a \dk)D  |5i)  \S2)  ...  | SN)  | ED 

+  -±=eie»  | R)a  \dN)D  |Si)  \S'2) ...  | S’N)  | E%)  .  (58) 

Now  we  use  similar  techniques  as  in  §3.2  to  show  that  the  probabilities  of  each  component  of  \I/a 
are  the  same  as  the  probability  of  the  iV-th  component  of  'frp.  Focusing  on  the  reduced  density 
matrix  of  Alice+Display  k.  we  see  that 

P{dk\^a)  =  P{dN\A>p)  .  (59) 

Combining  these  results  for  all  k,  we  see  that  the  probability  of  each  term  in  'kQ,  must  be  equal 
and  thus  that  the  probability  of  each  dk  in  state  T  is  given  by  the  Born  rule  (because  (56), 
and  \k,  (54),  agree  on  the  Alice+Detector  reduced  density  matrix).  In  combination  with  the 
discussion  in  the  previous  paragraph,  this  completes  the  proof  that  the  probabilities  for  different 
states  dk  in  the  general  state  Tq  are  given  by  the  Born  rule. 
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